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DNA DEMETHYLASE, THERAPEUTIC AND 
DIAGNOSTIC USES THEREOF 

BACKGROUND OF THE INVENTION 

(a) Field of the Invention 

The invention relates to a novel enzyme, DNA 
demethylase, therapeutic and diagnostic uses thereof. 

(b) Description of Prior Art 

Many lines of evidence have established that 
modification of cytosine moieties residing in the dinu- 
cleotide sequence CpG in vertebrate genomes is involved 
in regulating a number of genome functions such as 
parental imprinting, X-inactivation, suppression of 
methylation of ectopic genes and differential gene 
expression (Szyf, M . (1996) Pharmacol. Ther. 70, 1-37). 
DNA methylation performs its function of differentially 
marking genes because the distribution of methylated 
CpGs is tissue- and site- specific forming a pattern of 
methylation (Szyf, M. (1996), Pharmacol. Ther. 70, 1- 
37) . It is clear that the pattern of methylation is 
fashioned by a sequence of methylation and demethyla- 
tion events (Brandeis, M. et al . (1993) Bioassays 15, 
709-713) during development and is maintained in the 
fully differentiated cell (Razin, A. et al . (1980) Sci- 
ence 210, 604-610) . While it was originally suggested 
that DNA demethylat ion is accomplished by a passive 
loss of methyl groups during replication (Razin, A. et 
al. (1980) Science 210, 604-610), it is now clear that 
an active process of demethylation occurs in embryonal 
cells (Frank, D. et al . (1991) Nature 352, 239-241), in 
differentiating cell lines (Razin, A. et al - (1986) 
Proc. Natl. Acad. Sci. USA 83, 2827-2831; Szyf, M. et 
al. (1985) Proc. Natl. Acad. Sci. USA 82, 8090-8094) 
and in response to estrogen treatment (Saluz, H.P. et 
al. (1986) Proc. Natl. Acad. Sci. USA 83, 7167-7171) . 
Two modes of demethylation have been documented: site 
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specific deme thylat ion that coincides in many instances 
with onset of gene expression of specific genes and a 
general genome wide demethylat ion that occurs during 
early development in vivo during cellular differentia- 
5 tion and in cancer ceils ( Feinberg , A. P. et al . (1983) 
Nature 301, 89-92; Razin, A. et al . (1986) Proc, Natl. 
Acad. Sci. USA 83, 2827-2831). The global demethyla- 
tion is consistent with the hypothesis that a general 
demethylase activity which is activated at specific 
10 points in development or oncogenesis exists. It has 
been hypothesized that one mechanism regulating the 
pattern of methylation is the control of expression of 
methyltransf erase (Szyf, M. (1991) Biochem. Cell Biol, 
69, 164-167) and demethylase activities (Szyf, M . (1994) 

15 Trends Pharmacol. Sci. 7, 233-238). Although exten- 
sive information has been obtained on the enzymatic 
activity responsible for methylation and the regulation 
of its expression in the last two decades (Szyf, M. 
(1996) Pharmacol. Ther. 70, 1-37), the identity of the 

20 demethylase has remained a mystery. It is clear how- 
ever that to fully understand how patterns of methyla- 
tion are formed and maintained and to determine their 
role in development t physiology and oncogenesis, one 
has to identify the demethylase enzyme (s) . Two main 

25 difficulties have inhibited the identification of this 
enzyme. First, it is believed that demethylat ion of a 
methylated cytosine is chemically highly unlikely since 
it involves breaking a very stable C-C bond. Second, 
demethylation occurs at very defined stages in develop- 

30 ment (Brandeis, M . et al . (1993) Bioassays 15, 709-713) 
and identifying an adequate tissue source for this 
enzyme is critical . 

Whereas no bona fide demethylase has been iden- 
tified to date, alternative biochemical mechanisms 

35 involving exchange of methylated cytosines with non- 
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methylated cytosines have been described. One previ- 
ously proposed mechanism is removal of the methylated 
base by a glycosylase and its replacement with a non- 
methylated nucleotide utilizing an "excision-repair" 

cz r^^^"h^T-i-ioTn /P^^-i-n J\ ^ 1 f 1 Qfifi) Prnr NT^ t~ 1 Z\ n p\ H 

J t t L V V . J. X il J. uui \ J. \- J— * * -» / * * . — — » - \ — — — — / — — — — - — ■ • 

Sci . USA 83, 2827-2831). Glycosylase activities that 
can remove methylated cytosines from DNA have been dem- 
onstrated by Vairapandi and Duker (Vairapandi, M. et 
al. (1993) Nucl. Acids Res. 21, 5323-5327) and more 

10 recently by Jost (Jost, J. P. et al . (1995) J. Biol. 
Chem. 270, 9734-9739). However it is not clear 

whether this activity is responsible for the general 
demethylat ion observed in cellular differentiation. 
The fact that the activity identified by Jost acts spe- 

15 cifically on hemimethylated sequences (which is not the 
natural substrate in most cases) and can remove thymi- 
dines as well as 5 -methyl cytosines , supports a repair 
function for this glycosylase-demethylase (Jost, J. P. 
et al . (1995) J. Biol. Chem. 270, 9734-9739). An 

20 alternative mechanism involving a RNA dependent activ- 
ity has been recently described by Weiss et al . (Weiss 
et al . , 1996). This proteinase-insensitive RNA depend- 
ent activity has been shown to catalyze the excision 
and replacement of a methylated CpG dinucleotide with a 

25 nonmethylated CpG dinucleotide that is contained in a 
DNA -RNA hybrid molecule (Weiss, A. et al . (1996) Cell 
87, 709-718) . This activity which was identified in 
differentiating cells in culture was proposed to be 
involved in demethylat ion during development. These 

30 previous findings demonstrate that the common accepted 
model in the filed has been that a bona fide demethy- 
lase does not exist. 

It has been previously proposed that the exten- 
sive hypomethylat ion observed in cancer cells might be 

35 a consequence of activation of demethylase activity by 
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oncogenic pathways (Szyf, M. (1994) Trends Pharmacol. 
Sci. 7, 233-238 ; Szyf, M. et al . (1995) J. Biol. Chem. 
270, 12690-12696) . In accordance with this hypothesis 
we have shown that ectopic expression of v-Ha-ras had 
5 induced demethyiat ion activity in the cells (Szyf, M . 
et al. (1995) J. Biol. Chem. 27 0, 12 6 90-12696). Using 
an assay that directly measures the conversion of 3 ' 32 P 
labeled methyl dCMP (mdCMP) into dCMP, we have shov^n 
that nuclear extracts prepared from P19-Ras transfec- 

10 tants bear high levels of demethylase activity (Szyf, 
M. et al. (1995) J . Biol. Chem. 270, 12 690-12696). 
Building on this observation, we hypo the si zed that can- 
cer cell lines were a good source for demethylase. 
However, it is not evident that Ras expression in pl9 

15 cells does reflect the situation in cancer cells. P19 
is an embryonic cell and expression of Ras might be 
differentiating them. 

It would be highly desirable to be provided with 
a bona fide DNA demethylase (DNA dMTase) to alter 

20 developmental programs for therapeutic and biological 
use . 



SUMMARY OF THE INVENTION 

In accordance with the present invention, we 
25 demonstrate the purification of a bona fide DNA demeth- 
ylase (DNA dMTase) from a human lung cancer cell line 
A549, determine its kinetic parameters and substrate 
specificity. The DNA dMTase activity identified in 
this study converts methyl -dCMP (mdCMP) residing in the 
3 0 dinucleotide sequence mdCpG into dCMP whereas the 
methyl group is released as a volatile residue which 
was identified to be methanol. The activity is puri- 
fied away from any trace amounts of dCTP , is insensi- 
tive to the DNA polymerase inhibitor ddCTP , is not 
35 affected by the presence of methyl dCTP (mdCTP) in the 
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reaction and does not exhibit exonuclease or glyco- 
sylase activities. The identification of this new 
enzyme points out to new directions in our understand- 
ing of how DNA methylation patterns are formed and 
5 al tereri . 

One aim of the present invention is to provide a 
bona fide DNA demethylase (DNA dMTase) . 

In accordance with the present invention there 
is provided a DNA demethylase enzyme having about 
10 4 0 KDa, and wherein the DNA demethylase enzyme is over- 
expressed in cancer cells and not in normal cells. 

In accordance with the present invention there 
is provided a cDNA encoding human demethylase which 
comprises a sequence set forth in SEQ ID NO : 1 . 
15 In accordance with the present invention there 

is provided two mouse cDNAs homologous to the human 
cDNA , wherein the cDNA encoding mouse, demethylase hav- 
ing a sequence set forth in SEQ ID NOS:5-7. 

In accordance with the present invention there 
20 is provided a different human cDNA which encodes a pro- 
tein homologous to the human demethylase having a 
sequence set forth in SEQ ID NO : 3 . 

In accordance with the present invention there 
is provided the use of the expression of demethylase 
25 cDNAs to alter DNA methylation patterns of DNA in vitro 
in cells or in vivo in humans, animals and in plants. 

The demethylase cDNAs expression may be under 
the direction of mammalian promoters, such as CMV . 

The demethylase cDNAs expression may be under 
30 plant specific promoters to alter methylation in plants 
and to allow for altering states of development of 
plants and expression of foreign genes in plants. 

The demethylase cDNAs expression may be in the 
antisense orientation to inhibit demethylase in cancer 
35 cells for therapeutic processes. 
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The expression of demethylase cDNA in mammalian 
cells may be to alter their differentiation state and 
to generate stem cells for therapeutics, cells for ani- 
mal cloning and to improve expression of foreign genes. 

In accordance with the present invention there 
is provided the use of the expression of demethylase 
cDNAs in bacterial or insect cells for production of 
large amounts of demethylase. 

In accordance with the present invention there 
is provided the use of the expression of demethylase 
cDNAs for the production of protein in vertebrate, 
insect or bacterial or plant cells, such as antibodies 
against demethylase. 

In accordance with the present invention there 
is provided the use of the sequence of demethylase 
cDNAs as a template to design antisense oligonucleo- 
tides and ribozymes . 

In accordance with the present invention there 
is provided the use of the predicted peptide sequence 
of demethylase cDNAs to produce polyclonal or mono- 
clonal antibodies against demethylase. 

In accordance with the present invention there 
is provided the use of expression of cDNAs in two 
hybrid systems in yeast to identify proteins interact- 
ing with demethylase for diagnostic and therapeutic 
purposes . 

In accordance with the present invention there 
is provided the use of expression of cDNAs in bacte- 
rial, vertebrate or insect cells to produce large 
amounts of demethylase for obtaining a x-ray crystal 
structure and for high throughput screening of demethy- 
lase inhibitors for therapeutics and biotechnology. 

In accordance with the present invention there 
is provided a volatile assay for high throughput 
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screening of demethylase inhibitors as therapeutics and 
anticancer agents which comprises the steps of; 

a) using transcribed and translated demethylase 
cDNAs in vitro to convert methyl -cytosine pres- 
ent in methylated DNA samples to cytosine pres- 
ent in DNA and volatilize methyl group; 

b) determining the absence or minute amount of 
volatilize methyl group as an indication of an 
active demethylase inhibitor. 

In accordance with the present invention there 
is provided a volatile assay for the diagnostics of 
cancer in a patient sample which comprises the steps 

Of : 

a) determining demethylase activity in patient sam- 
ples by assaying conversion of methyl -cytosine 
present in methylated DNA to cytosine present in 
DNA and its volatilization as methyl' groups 
released as methanol ; 

b) determining the presence or minute amount of 
volatilized methyl released as methanol groups 
as an indication of cancer in the patient sam- 
ple . 

In accordance with the present invention there 
is provided the use of an antagonist or inhibitor of 
DNA demethylase for the manufacture of a medicament for 
cancer treatment, for restoring an aberrant methylation 
pattern in a patient DNA, or for changing a methylation 
pattern in a patient DNA. 

Such an antagonist is a double stranded oligonu- 
cleotide that inhibits demethylase at a Ki of 50nM, 

such as fc m GC m GC m GC m G] . 

[q^CG^CG^G^u 

The inhibitors include, without limitation an 

anti-DNA demethylase antibody/ an antisense of DNA 

demethylase or a small molecule such as any derivative 
of imidazole. 
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The change of the methylation pattern may acti- 
vate a silent gene. Such an activation of a silent gene 
permits the correction of genetic defect such as found 
for p- thalassemia or sickle cell anemia. 
5 The DMA demethylase of the present "invention may 

be used to remove methyl groups on DNA in vitro such as 
needed for cloning DNA . 

The DNA demethylase of the present invention or 
its cDNAs may be used, for changing the state of dif- 

10 ferentiation of a cell to allow gene therapy, stem cell 
selection or cell cloning. 

The DNA demethylase of the present invention or 
its cDNAs may be used, for inhibiting methylation in 
cancer cells using vector mediated gene therapy. 

15 In accordance with the present invention there 

is provided an assay for the diagnostic of cancer in a 
patient, which comprises determining the level of 
expression of DNA demethylase by either RT-PCT, ELISA 
or volatilization assay of the present invention in a 

20 sample from the patient, wherein overexpression of the 
DNA demethylase is indicative of cancer cells. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. 1A to IB illustrate the purification of 
25 demethylase (DNA dMTase) from human A549 cells; 

Figs. 2A and 2C illustrate that DNA dMTase is a 
protein inhibited by RNA and not by ddCTP, mdCTP; 

Figs. 2B and 2D illustrate the kinetics of DNA 
dMTase activity; 
30 Figs. 3A to 3C illustrate the product of DNA 

dMTase activity is cytosine and it exhibits no exonu- 
clease or glycosylase activity; 

Figs. 4A-4C illustrate the demethylat ion reac- 
tion releases methanol as a volatile residue; 
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Fig. 4D illustrates the transfer of a proton 
from water to regenerate cytosine; 

Figs. 4E-4F illustrate that the volatile product 
is methanol ; 

reaction; 

Figs. 6A-6D illustrate the substrate Specificity 
of DNA dMTase; 

Figs. 7A-7D illustrate chromatographic isolation 
of dMTase from human A549 cells ; 

Figs. 8A-8B illustrate the alignment between the 
MDB domain of MeCP2 and demethylase and the predicted 
amino acid sequence of human demethylase; 

Fig. 8C illustrates the mRNA encoded by demethy- 
lase ; 

Figs. 9A-9F illustrate the cDNA and their pre- 
dicted amino acid of demethylases and homologues of the 
present invention (SEQ ID NOS.1-8); 

Figs. 10A-B illustrate a mammalian expression 
vector of dMTase and in vitro translated dMTase poly- 
peptide ; 

Fig. IOC illustrates that in vitro translated 
DNA dMTase releases volatile methyl residues from meth- 
ylated DNA; 

Fig. 10D illustrates that in vitro translated 
DNA dMTase transform methylated cytosines to cytosines; 

Fig. 11A illustrates that transiently trans- 
fected demethylase releases volatile residues from 

methylated DNA; 

Fig. 11B illustrates the polypeptide expressed 

from transiently transfected demethylase; 

Figs. 11C-11E illustrate that transiently trans- 
fected demethylase transforms methylated cytosines to 
cytosines in a protein dependent manner; 
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Fig. 11F illustrates that the transformation of 
methylated cytosine to cytosine by transiently trans- 
fected demethylase depends on the concentration of sub- 
strate ; 

fected demethylase catalyzes the transfer of a proton 
from tritiated water to regenerate cytosine; 

Fig. 12B illustrates that the cloned demethylase 
releases methanol from methylated DNA; 

Figs. 13A-13C illustrate that the cancer cells 
express demethylase activity whereas normal cells do 
not ; 

Fig. 13D illustrates that demethylase mRNA is 
highly express in cancer cells; 

Fig. 14A illustrates demethylase bacterial ret- 
roviral and mammalian expression vector; 

Fig. 14B illustrates inhibition of demethylase 
activity by a specific inhibitor; 

Fig. 14C illustrates inhibition of tumorigenesi s 
in vitro by an inhibition of demethylase; 

Fig. 15 illustrates inhibition of tumorigenesis 
in cell culture by induced expression of demethylase 
antisense vector; 

Fig. 16 illustrates the inhibition of demethy- 
lase by a small molecule inhibitor imidazole; and 

Fig. 17 illustrates a model for the inhibition 
of cancer growth by an inhibition of demethylase. 



DETAILED DESCRIPTION OF THE INVENTION 

The pattern of methylation is fashioned during 
development by a sequence of methylation and demethyla- 
tion events. The identity of the demethylase has 
remained a mystery and alternative biochemical activi- 
ties have been shown to demethylate DNA but no activity 
that can truly remove methyl groups from DNA has been 
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shown to date. Utilizing human lung carcinoma cells as 
a source for demethylase activity we demonstrate that 
mammalian cells bear a bona fide DNA demethylase (DNA 
dMTase) activity. DNA dMTase transforms methyl-C to C 
5 by catalyzing replacement of the methyl group on the 5 
position of C with a hydrogen derived from water. DNA 
dMTase demethylates both fully methylated and hemimeth- 
ylated DNA, shows dinucleotide specificity and can 
demethylate mdCpdG sites in different sequence con- 
10 texts. This enzyme is different from previously 

described demethylat ion activities: it is proteinase 
sensitive, activated by RNase and releases different 
products . 

DNA dMTase is a novel enzyme showing a new and 

15 unexpected activity that has not been previously 
described in any organism. The finding of a bona fide 
demethylase, points out new directions in our under- 
standing of the biological role of DNA methylation. 

In spite of the fact that it was previously 

20 shown that Ras expression in pl9 cells can induce 
demethylat ion activity. It was not clear whether this 
demethylation activity is indeed a bona fide demethy- 
lase. One would predict that demethylase is present in 
embryonal cells. It was surprising to see that demeth- 

25 ylation activity is present in cancer cells. The find- 
ing of high levels of demethylase in A549 cells is 
indeed an unexpected discovery. 

In accordance with the present invention, it is 
shown and demonstrated that demethylation occurs by 

30 removal of a methyl group from methylated cytosine in 
DNA, that a hydrogen from water replaces the methyl 
group at the 5 ' position, that the resulting methyl 
group reacts with the remaining hydroxyl from water to 
generate methanol which volatilizes (Fig. 4E-F) . Thus, 
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bona fide demethylat ion of DNA involves the following 
reaction : 

CH 3 ~ cytosine - (DNA) +H-OH - demeth ^ ^^ > H-cytosine + CH 3 -OH 

5 The cDNA cloned in accordance with the present 

invention is the demethylase since it can convert 
methyl -cytosines in DNA to cytosines and volatilize the 
methyl groups on DNA when transcribed and translated in 
vitro which are released as methanol. This is a novel 
10 cDNA encoding a biochemical activity that has been not 
described before. 

In accordance with the present invention/ there is 
shown a model for the inhibition of cancer growth by an 
inhibition of demethylase (Fig. 17). 

15 

EXPERIMENTAL PROCEDURES 
Cell Culture 

A549 Lung Carcinoma cells (ATCC : CCL 185) were 
grown in Dulbecco's modified Eagle's medium (with low 

20 glucose) supplemented with 10% fetal calf serum, 2 mM 
glutamine, 10 U/ml cif rof loxacin . Human Skin Fibro- 
blasts #72-213A MRHF were obtained from BioWhit taker , 
Bethesda and were grown in Dulbecco's modified Eagle's 
medium supplement with 2% fetal calf serum, 2 mM gluta- 

25 mine. H446 Lung carcinoma cells (ATCC: HTB 171) was 
grown in RPMI 1640 medium with 5% fetal calf serum. 
Preparation of nuclear extract 

Nuclear extracts were prepared from A549 cul- 
tures at near confluence as previously described (Szyf 

30 et al . , 1991/ Szyf et al.,1995). The cells were tryp- 
sinized, collected and washed with phosphate-buffered 
saline and suspended in buffer A (10 mM Tris, pH 8.0, 
1.5 mM MgCl 2 , 5mM KC1 , 0.5% NP-40) at the concentration 
of 10 9 cells per ml for 10 min. at 4°C. Nuclei were 

35 collected by centrif ugat ion of the suspension at 1000 g 
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for 10 minutes . The nuclear pel let was re suspended in 
buffer A (400 /il) and collected as described in the 
experimental procedures. A nuclear extract was pre- 
pared from the pelleted nuclei by suspending them in 

rr -u,,-F^^^ n /on i-oKjT T-v-to -rsU Q H O ^ %■ nl ^ t H O mM PHTA 

and 0.4 mM NaCl) at the concentration of 3.3xl0 8 nuclei 
per ml and incubating the suspension for 15 min. at 
4°C. The nuclear extract was separated from the 
nuclear pellet by cent rifugat ion at 10,000g for 30 min- 

10 utes. Nuclear extract were stored in -80°C for at least 
two months without loss of activity. 
Chromatography on DEAE-Sephadex 

A freshly prepared nuclear extract (1 ml , 1.1 
mg) was passed through a Microcon™ 100 spin column, the 

15 retainant was diluted to a conductivity equivalent to 
0.2 M NaCl in buffer L and applied onto a DEAE-Sephadex 
column (Pharmacia) (1.0 x 5 cm) that was preequili- 
brated with buffer L (10 mM Tns-HCl, pH 7 . 5 , 10 mM 
MgCl 2 ) containing 0.2 M NaCl at a flow rate of 1 

20 ml/min. The column was then washed with 15 ml of the 
starting buffer (buffer L + 0.2 M NaCl) and proteins 
were eluted with 5 ml of a linear gradient of NaCl 
(0.2-5.0 M) . 0.8 ml fractions were collected and 
assayed for demethylase activity after desalting 

2 5 through a Microcon™ 10 spin column (Ami con) and re sus- 
pension of the retainant in 0.8 ml buffer L. DNA 
demethylase eluted between 2-5.0 M NaCl. 
Chromatography on S-Sepharose 

Active DEAE-Sepharose column fractions were 

30 pooled, adjusted to 0.1 M NaCl by dilution and loaded 
onto an S-sepharose column (Pharmacia) (1.0 x5 cm) 
which had been preequilibrated with buffer L containing 
0.2 M NaCl at a flow rate of 1 ml/min. Following wash- 
ing of the column as described in experimental proce- 

35 dures, the proteins were eluted with 5 ml of a linear 
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NaCl gradient (0.2-5.0M). 0.5 ml fractions were col- 
lected and assayed for DNA demethylase activity after 
desalting and concentrating to 0.2 ml using a Microcon™ 
10 spin column. DNA demethylase activity eluted around 
5 5 . 0 M NaCl . 

Chromatography on Q-Sepharose 

Active fractions from S-sepharose column were 
pooled, adjusted to 0.2 M NaCl by dilution and applied 
onto a Q-sepharose (Pharmacia) column (1.0 x5 cm) which 

10 had been equilibrated as described in the experimental 
procedures at a flow rate of 1 ml/min. The column was 
washed and the proteins were eluted with a linear NaCl 
gradient (0.2- 5.0 M) . Fractions (0.5 ml) were col- 
lected, assayed for demethylase activity after desalt- 

15 ing and concentrating to a final volume of 0.2 ml as 
described in the experimental procedures . The demethy- 
lase activity eluted around 4.8-5.0 M NaCl. 
Gel-Exclusion. Chromatography on DEAE-Sephacel 

The pooled fractions of Q-sepharose column were 

20 adjusted to 0.2 M NaCl, loaded onto a 2.0 x 2.0 cm 
DEAE-Sephacel column (Pharmacia) and eluted with 10 ml 
of buffer L containing 0.2 M NaCl. The fractions (0.8 
ml) were collected and assayed after concentration to 
about 180 /il with a Microcon™ 10 spin column for DNA 
.25 demethylase activity. The activity was detected at 
fraction 4, which is very near the void volume 
(~200kDa) . 

Assay of DNA demethylase activity 

To directly assay DNA demethylase activity in 
3 0 vitro two independent methods were applied. 

(A) To assay the conversion of methyl -dCMP (mdCMP) to 
dCMP we used a previously described method (Szyf et 
al., 1995). Briefly, a 32 P labeled, fully methylated 
poly [mdC 32 PdG] n substrate was prepared as follows. One 
35 hundred ng of a double-stranded fully methylated 
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(mdCpdG) oligomer (Pharmacia) were denatured by boil- 
ing, which was followed by partial annealing at room 
temperature. The complementary strand was extended 
with Klenow fragment (Boehringer Mannheim) using 

5 mecnyr - D - ± r ^ u . jl ulti; \Duciinuyci r^cuumts ctnu, 

[a- 32 P] GTP (100 /iCi, 3000 Ci/mmol), and the unincorpo- 
rated nucleotides were removed by chromatography 
through a NAP-5 column (Pharmacia) . The NAP-5 chroma- 
tography was repeated to exclude minor contamination 

10 with unincorporated nucleotides. As a control a non- 
methylated poly [dC 32 pdG] n substrate was similarly pre- 
pared except that a nonmethylated dCpdG oligomer served 
as a template and dCTP was used in the extension reac- 
tion. The column fractions (30 ill) , described in the 

15 experimental procedures were incubated with 1 ng of 
poly [mdC 32 pdG] n substrate for 1 hour at 3 7 °C in a 
buffer L containing 25% glycerol (v/v) and 5 mM EDTA. 
The reacted DNA as well as a nonmethylated 
poly [dC 32 pdG] n and methylated [mdC 32 pdG] n nonreacted con- 

20 trols were purified by phenol /chloroform extraction and 
subjected to micrococcal nuclease digestion (100 /xg at 
10 /il) and calf spleen phosphodiesterase (2/ig) 
(Boehringer) (Pharmacia) to 3' mononucleotides for 15 
hours at 37°C. The digestion products were loaded onto 

25 a thin layer chromatography plate (TLC) (Kodak, 13255 
Cellulose) , separated in a medium containing, 132 ml 
Isobutyric acid:40 ml water: 4 ml ammonia solution, 
autoradiographed and the intensity of the different 
spots was determined using a phosphorimager (Fuji, BAS 

30 2000 ) . 32 P labeled substrates and tritium labeled sub- 
strates were phosphoimaged using BAS 2000 plate and 
BAS-TR2040 phosphorimager plate respectively. 
(B) The second method determined removal of methylated 
residues from methylated DNA by measuring disappearance 

35 of 3 H-CH 3 or 14 C-CH 3 from the reaction mixture. 100 ng 
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of poly [dCdG]n double stranded DNA was methylated 
using SssI methylase (New England Biolabs) and an 
excess of [ 3 H-methyl AdoMet (80 Ci/mmol ; New England 
Nuclear) ] . The tritiated methyl group containing DNA 
5 was purified from labeled AdoMet using NAP-b column 
chromatography. All column purified fractions of DNA 
demethylase were assayed using the tritiated substrate . 
In a typical assay, 1 ng of DNA was incubated (at a 
specific activity of 4 xl0 6 dpm/mg) wi th 30 fil of column 

10 fraction for one hour at 37°C in buffer L. To deter- 
mine the number of methyl groups remaining in the DNA 
following incubation with the different fractions, 250 
111 of water were added and the mixture was incubated at 
65°C for 5 minutes. One hundred jxl of the reaction 

15 mixture were withdrawn for liquid scintillation count- 
ing. Controls received similar treatment except that 
in place of a column fraction, an equal volume of 
buffer L was added. The number of methyl groups that 
were removed from the DNA by the different fractions 

20 was determined by subtracting the remaining counts in 
each of the fractions from the counts remaining in the 
control. All tests were carried out in triplicates. 
The results are presented as picomole methyl group 
removed. One unit of DNA dMTase activity .is defined 

25 as: amount of enzyme that releases one picomole of 
methyl group from methylated dCpdG substrate in one 
hour at 3 7 °C. 

Methyl removal assay using double- labeled substrates 

To determine whether the methyl group leaves the 
30 DNA and not any non-specific removal of tritium, we 
prepared SK plasmid DNA containing a tritiated hydrogen 
at the 6' position of cytosine and thymidine by growing 
the plasmid harboring bacteria in the presence of deoxy 
[6~ 3 H] Uridine (22 Ci/mmol; Amersham) (10/zCi/ml). The 
35 [6 - 3 H] -cytosine containing pBluescript SK( + ) was puri- 
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fied according to standard protocols and was methylated 
using an excess of [ 14 C-methyl] AdoMet (59 mCi/mmol; 
Amersham) (10 /zCi per 100 jjlI reaction) and SssI methy- 
lase . The double labeled DNA substrate was purified 
5 twice on a NAP -5 column. 15 fil of DNA dMTase were 
incubated with 1 ng of double labeled DNA (specific 
activity of 2000 dpm/ng) for 1 hour at 37°C. Follow- 
ing incubation, the remaining 14 C versus 3 H counts were 
determined as described in the experimental procedures 

10 by scintillation counting (Wallac) . The 14 C counts were 
normalized against 3 H counts. The controls received 
similar treatment except that instead of DNA dMTase, an 
equal amount of distilled water was added to them. 

To determine the number of 3 H-CH 3 in the gaseous 

15 phase, 1 ng of 3 H-CH 3 poly [dCpdG] DNA were incubated 
with DNA dMTase overnight in a sealed tube (Pierce, 
Illinois, USA). 0.8 ml of air were removed from the 
tube using a gas tight syringe (Hamilton, Reno, Nevada) 
and injected into a sealed gas tight scintillation vial 

20 containing 10 ml OptiPhase scintillation fluid (Wallac, 
UK) and counted. As a control the DNA was incubated 
with an equal volume of buffer L and treated similarly. 
Synthesis of other methylated dC dinucleotides 

Poly [mdC 32 pdA] and [mdC 32 pdT] substrates were 

25 prepared as follows. About 0.5 fig of 20 mer oligonu- 
cleotides 5 ' (GG) 103 ' , 5'(GT)103' and 5'(GA)103' were 
boiled and annealed at room temperature with oligonu- 
cleotide 5'CCCCCC3', 5'CACACA3' and 5'CTCTCT3' respec- 
tively. The complementary strand was extended with 

3 0 Klenow fragment using m5dCTP (Boehringer Mannheim) and 
either [a 32 P] dATP (100/zCi, 3000Ci/nmol) or [a 32 P] dTTP 
(100 fiCi, 3000 Ci/mmol) respectively. The unincorpo- 
rated nucleotides were removed by chromatography 
through a NAP-5 column. Hemimethylated mdCpG substrate 

35 was prepared in a similar manner except that a nonmeth- 
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ylated poly dCpdG substrate (Boehringer) was used as 
template and mSdCTP and [a 32 P] dGTP were used for exten- 
sion as described in the experimental procedures. 
Assay for nuclease and glycosylase activity 
5 [ 32 pmdCpdG] n substrate which included a labeled 

32 P 5' to mdC was prepared as follows. About 100 ng of 
poly dCpdG DNA were boiled and partially annealed at 
room temperature. [a 32 P]dCTP and cold dGTP were used for 
complementary strand extension as described in the 
0 experimental procedures . The free nucleotides were 
separated using NAP- 5 column chromatography. The puri- 
fied [ 32 pmdCpdG]n DNA was subjected to methylation by 
SssI methylase using 320 /iM AdoMet . The DNA was repuri- 
fied twice using a NAP- 5 column. The methylated DNA (1 
5 ng) was incubated with either 3 0 jul DNA dMTase, nuclear 
extract or buffer L. To determine whether a 32 P labeled 
residue is excised from the DNA it was directly applied 
(3//1) onto a TLC plate. To determine whether the DNA 
was demethylated it was subjected to digestion with 
snake venom phosphodiesterase (0.2 mg in a lOjxl reac- 
tion volume) (Boehringer Mannheim) which attacks the 
3 '-OH group releasing 5' -mononucleotides . The result- 
ing mononucleotides were separated on TLC plates and 
autoradiographed . 

To test whether dCTP copurifies with DNA dMTase, 
which may be involved in activities other than bona, 
fide demethylation, 20 of dCTP with 1 p.1 of a 32 P 

labeled dCTP (3000 Ci/mmole) was loaded onto the column 
with nuclear extract . The 32 P counts were measured in 
the flow through, washes and in the different frac- 
tions. About 1.1 million counts were loaded onto the 
DEAE-Sepharose column and were all recovered up to 
fraction 8 . 

To determine whether DNA dMTase contains a DNA 
polymerase activity, DNA demethylase reactions were 
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performed in presence of 500 fiM of ddCTP (Pharmacia) or 
500 /uM of mSdCTP (Boehringer Mannheim) at initial rate 
conditions . 

To determine whether DNA dMTase is sensitive to 
RNase or Proteinase K treatment , DNA dMTase was pre- 
treated for 1 h at 56°C with 200 /ig/ml proteinase K 
(Sigma) . A dernethylat ion reaction was carried out with 
this pretreated fraction in the usual manner using both 
dernethylat ion assays described in the experimental pro- 
cedures. To test the effect of RNA digestion on the 
dernethylat ion reaction, the fractions from different 
columns were treated with 100 /zg/ml RNase A (Sigma) . 
Dernethylat ion of pBluescript SK(+) Plasmid 

About 4 fig plasmid pBluescript SK (Stratagene) 
was subjected to methylation using SssI methylase. The 
methylated plasmid (4 ng) was incubated with 30 fil of 
DNA dMTase Fraction 4 of DEAE - Sephacel column under 
standard conditions, extracted with phenol: chloroform 
and precipitated with ethanol . About 1 ng of the plas- 
mid were subjected to digestion with 10 units each of 
either of the restriction endonucleases EcoRII (GIBCO- 
BRL) , Dpnl, Hhal or Hpall (New England Biolabs) before 
and after methylation as well as after DNA dMTase 
treatment in a reaction volume of 10 fil for 2 hour at 
37°C. Following restriction digestion the plasmids 
were extracted with phenol : chloroform, ethanol precipi- 
tated and resuspended in 10 fil . The plasmids were 
elect rophoresed on a 0.8% (w/w) Agarose gel, trans- 
ferred onto a Hybond Nylon membrane and hybridized with 
pBluescript SK(+) plasmid which was 32 P labeled by ran- 
dom-priming (Boehringer Mannheim) . 

Effect of Redox Reagents (NAD, NADH, NADP , NADPH and 
FeCl 3 ) on demethylase activity 

The reagents were prepared at 100 /iM concentra- 
tion and added at a final concentration of 10 jiM to a 
standard methyl removal assay under initial rate condi- 
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tions as described in the experimental procedures. The 
methyl removal activity in presence of each of the 
cof actors was compared to a control DNA dMTase reac- 
tion . 

5 Determination of kinetic parameters 

For determination of kinetic parameters, the 
demethylat ion reactions were performed using both 
assays (generation of dCMP and removal of methyl) as 
described in the experimental procedures except that 
0 varying DNA concentrations from 0 . 1 nM to 2.5 nM were 
used in a total volume of 5 0/il including 30 fil of DNA 
dMTase, Since it has been established by previous 
experiments that the reaction proceeds for at least 3 
hours, the initial velocity of reaction was measured 
5 at one hour intervals. The velocity data was collected 
at each substrate DNA concentration range stated for 
both assays. The Km and Vmax values for DNA demethy- 
lase activity were determined from double reciprocal 
plots of velocity versus substrate concentration. 

Measurements of methanol production catalyzed by 
demethylase by gas chromatography 

Gas chromatography was performed with a Varian™ 
model 3400 GC equipped with a 30m Stabilwax™ column 
(0.053 cm i.d. : Restek Corporation) . Nitrogen™ was 
used as carrier gas at a flow rate of 32 ml/min, the 
injector and detector chambers were at 200 and 300°C 
respectively. The column was maintained at 40°C for 5 
minutes after sample injection. 

The demethylase reaction was performed in eppen- 
dorf tubes kept within sealed scintillation vials with 
300 ul of water as aqueous phase (in radioactive trap- 
ping experiments this was replaced by 300 ul of metha- 
nol). The demethylase reaction was initiated in buffer 
L (10 mM MgCl 2 , 10 mM Tris-HCl pH 8.0) with 500 ng of 
tritiated SK plasmid (6000 dpm/ul) and 100 ul of 
demethylase at 37°C. After overnight incubation at 37°C / 
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the aqueous phase surrounding the eppendorf tube was 
transferred to a fresh eppendorf tube, 2 ul of this 
mixture was injected in the gas chromatography using a 
gas tight syringe (Hamilton, Reno, Nevada) . 
5 Coupled in vitro transcription translation 

The mRNAs encoded by the pcDNA 3.1/His Xpress 
demethylase constructs described above were transcribed 
and translated by coupled transcription- translation 
using Promega™ TNT reticulocyte lysate kit (according 
0 to manufacturer's protocol) , 2 jig of each construct and 
40^uCi of [ 35 ~S] methionine ( 1 , 0 0 OCi/mmol , Amersham) in a 
50/xl reaction volume. To purify non labeled in vitro 
translated demethylase, coupled in vitro transcription 
and translation was performed as above but in the pres- 
5 ence of cold methionine. The translation products were 
bound to a Probond™ nickel column (Invitrogen) and 
demethylase was eluted according to the manufacturer's 
protocol with increasing concentrations of imidazole. 
Demethylase is eluted at 350-500mM imidazole. The imi- 
dazole eluted demethylase was dialyzed and concentrated 
by lyophili zat ion . 

Gas chromatography coupled with Mass spectrometry (GC- 
MS) Analyses for identification of volatile product of 
demethylase catalyzed reaction as methanol 

The demethylation reactions (volume 50 1) were 
run in conical vials having a total internal volume of 
350 microlitres. The vials were closed with a teflon- 
lined screw cap and left at room temperature for 18 h. 
The vials were cooled in an ice bath, opened and 10 mg 
of NaCl and 50 microlitres of toluene were added . The 
vials were frequently shaken over a period of 1 h. The 
toluene phases were pipetted into clean vials in a man- 
ner to rigorously exclude water carry over. Anhydrous 
sodium sulfate (5 mg) was added to the toluene extracts 
to remove water, and the toluene phases were pipetted 
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into autoinjector vials for GC/MS analysis. Aliquots 
of 3 microlitres were analyzed under the following 
instrumental conditions : Instrument ; Hewlett-Packard 
5988A; Column: 30 m x 0.25 mm i.d. fused quartz capil- 
5 iary with 0.2b micron DB-1 liquid phase, programmed 
after an initial hold for 1 min at 70 deg at 5 deg/min 
to 80 deg, then ramped ball ist ically to 280 deg for 
bake-out for 5 min; Injector and interface tempera- 
tures: 250 deg; Helium flow rate 1.5 ml /min; Mass 
10 spectrometer: ion source 200 deg, 70 eV electron impact 
ionization, scanning from m/z 10 to 50 in full scan 
mode was begun 6 s after injection, and ceased at 1,5 
min to avoid acquisition of the intense toluene solvent 
peak . 

15 

Human A549 cells bear a demethylase activity that could 
be purified away from dCTP and DNA MeTase 

The use of an appropriate cellular source and a 

direct assay for demethylase activity are obviously 

20 critical. As we have previously shown that demethylase 
activity was induced in response to ectopic expression 
of the Ras oncogene (Szyf et al . , 1995) we reasoned 
that cancer cells might bear high levels of demethy- 
lase activity. Based on preliminary studies demon- 

25 strating the presence of high levels of demethylase 
activity in the human lung carcinoma cell line A549, we 
have chosen this cell line for our further studies and 
purification steps. Previous studies have used indi- 
rect measures such as increased sensitivity to methyla- 

30 tion-sensitive restriction enzymes as indicators of 
demethylase activity (Weiss et al . , 1996; Jost et al . , 
1995) . To directly measure the conversion of 5-mdCMP 
in DNA to dCMP, we have utilized a completely methyl- 
ated 32 P labeled [mdC 32 pdG]n double stranded oligomer 

35 which we had previously described (Szyf et al., 1995). 
Following incubation with the different fractions, the 
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DNA is purified and subjected to cleavage with microco- 
cal nuclease to 3' mononucleotides. The 3' labeled 
mdCMP and dCMP are separated by thin layer chromatogra- 
phy (TLC) and the conversion of mdCMP to dCMP is 
5 directly determined. This 6. s s ay provides a stringent 
test for bona fide demethylat ion and discriminates it 
from previously described 5mCpC replacement activities 
(Jost et al . , 1995; Weiss et al . , 1996). The glyco- 
sylase-demethylase activity described by Jost et al . 
10 (Jost et al . , 1995) will require the presence of a 
ligase activity and an energy source for replacement of 
mdC with C to be detected by our assay, whereas the 
demethylase activity described by Weiss et al . will not 
be detected since it replaces the intact mdC 32 pdG dinu- 
15 cleotide with a cold dCpdG without altering its state 
of methylation (Weiss et al . , 1996). 

Nuclear extracts were prepared from A549 cells, 
applied onto a DEAE - Sephadex column, eluted with a lin- 
ear gradient from 0.2-5. OM NaCl and the fractions were 
20 assayed for demethylase (dMTase) activity as described 
in the experimental procedures. As shown in Fig. 1(A) 
a clear peak of dMTase activity is eluted at the high 
salt fraction 10. 

Conversion of methylated cytosine to cytosine: 
25 Nuclear extracts prepared from A549 cells (1.1 mg) were 
passed through an AMICON™ 100 spin column. The retain- 
ant (98.56 mg, 0.2 mg/ml) was loaded onto a DEAE-Sepha- 
rose column, the different chromatographic column frac- 
tions eluted by a linear NaCl gradient (0.2-5M) were 
3 0 desalted and (30 jil) incubated with 1 ng of [mdC 32 pdG] n 
double stranded oligomer for 1 hour at 37°C, digested 
to 3' mononucleotides and analyzed on TLC as described 
in the experimental procedures . Control methylated 
(ME) and nonmethylated (NM) [dC 32 pdG] n substrates were 
3 5 digested to 3 ' mononucleotides and loaded on the TLC 
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plate to indicate the expected position of dCMP and 
mdCMP . The active fraction is indicated by an arrow. 
This fraction was loaded on S-Sepharose followed by Q- 
Sepharose and DEAE-Sephacel fractionation. 

The first chromatography step purified the 
dMTase activity from the bulk of nuclear protein 

(Fig. IB) and is a very effective purification step. 

DNA dMTase activity as measured by the release 
of volatile methyl residues. The different column 
fractions were incubated with lng (4 x 10 6 dpm/^g) of 

CE] - CH 3 ~ [mdCpdG] n oligomer and the release of volatile 
methyl residues was determined (-) and presented as 
total dpn) . The results are an average of three inde- 
pendent determinations. Protein concentration was 
determined using the Bio-Rad Bradford kit (-). The 
elution profile of 20 /iM of [ 32 P] -oc-dCTP incubated with 
the protein was determined by scintillation counting of 
the different DEAE fractions (-) and presented as frac- 
tion of dCTP loaded on the column. 

To exclude the possibility that the DNA dMTase 
activity detected in our assay is carried by the DNA 
MeTase, we assayed the fractions for DNA MeTase activ- 
ity using a hemimethylated DNA substrate as previously 
described (Szyf et al . , 1991). As observed in Figure 
IB DNA MeTase activity is detected in the second and 
third fractions, thus our fractionation separated DNA 
dMTase away from the DNA MeTase suggesting that they 
are independent proteins. 

There is a remote possibility that the demeth- 
ylation observed is not a bona fide demethylat ion but 
a consequence of a glycosylase removal of mC, followed 
by removal of the remaining deoxyribose-phosphate by AP 
(apyrimidine) nuclease, repair of the gap catalyzed by 
DNA polymerase using trace dCTP contained in the frac- 
tion and ligation of the break with ligase in the pres- 
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ence of residual ATP. For this hypothesis to be con- 
sistent with our data, four independent enzymes and two 
cof actors have to cof rac t ionate with DNA dMTase, To 
exclude the possibility that a trace amount of dCTP is 
bound to DNA dMTase active fraction; we have added 2 0 
fiH of 32 P labeled dCTP (10x10 s cpm) to the nuclear 
extract and determined its elution profile on the DEAE 
column. Less than background cpm (10 cpm) were 

detected in the DNA dMTase active fraction suggesting 
that our first column purifies dCTP away from the DNA 
dMTase at least IxlO 6 fold (Fig. IB) . If any dCTP is 
present in the nuclear extract, the remaining concen- 
tration after fractionation on DEAE is well below the 
Kms of the known DNA polymerases. The possibility that 
dCTP is so tightly bound to the enzyme that it could 
not be replaced by the exogenous 32 P labeled dCTP is 
very remote since an enzyme using dCTP as substrate 
must readily exchange dCTP . 

The active fraction 10 was further fractionated 
sequentially on the following columns; S-Sepharose and 
Q-Sepharose. The DNA dMTase eluted at the high salt 
fraction from both columns as determined by the 
[mdC 32 pdG] n demethylat ion assay (Fig. 1A) . The ion 
exchange chromatography was followed by chromatography 
on DEAE-Sephacel . 

The fact that we have maintained our activity 
even after 4 fractionation steps (Table 1) and that 
only a single polypeptide is apparent after the last 
purification step argues strongly against the possibil- 
ity that the activity detected in our study is a repair 
or replacement activity. Any replacement mechanism 
must involve a number of proteins and additional cofac- 
tors and substrates. In summary/ the chromatography of 
the demethylase activity in A459 cells provides strong 
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support to the hypothesis that mammalian cells bear a 

bona, fide deme thylase activity. 

DNA dMTase releases a volatile derivative 

A bona, fide demethylat ion has to result in 
5 release of the methyl group as a volatile derivative 
such as C0 2 , methanol, methane or formaldehyde. We 
have therefore incubated a { [ 3 H] -CH 3 -dCpdG}n double 
stranded oligonucleotide with the different column 
fractions and the rate of release of the tritiated 
10 methyl from the aqueous phase was determined by scin- 
tillation counting of the remaining radioactivity in 
the reaction mix. As demonstrated in Fig. lb (dia- 
mond) , the dMTase active fractions release labeled 
methyl groups from the methylated substrate. 

15 

DNA dMTase is a protein which is inhibited by RNA, does 
not involve an exchange activity and does not require 
additional co factors 

DNA dMTase activity measured either as transfor- 

20 mation of mdC to C (Fig. 2a) or as release of volatile 
methyl residues (Fig. 2c) is abolished after proteinase 
K treatment and is not inhibited but rather enhanced 
following RNase treatment, 500 (jlM of ddCTP which 
inhibits DNA polymerase does not inhibit demeth- 

25 ylation of the [mdC32pdG]n substrate, nor is it inhib- 
ited by high concentrations of methyl -dCTP (500 /zM) 
(Fig. 2a), which is consistent with the hypothesis that 
demethylat ion does not involve an excision and replace- 
ment mechanism. If a replacement mechanism is involved 

30 in demethylation, the presence of mdCTP should result 
in incorporation of methylated cytosines and essential 
inhibition of demethylation. Thus, the DNA dMTase 
identified here is a protein and not an RNA and is une- 
quivocally different from the previously published RNA 

35 based or glycosylase based demethylase activities. 
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The DNA dMTase reaction proceeds without any 
requirement for additional substrates such as dCTP, 
redox factors such as NADH and NADPH or energy sources 
such as ATP (data not shown) . As observed in Fig. 2b 
5 and 2d, the DNA dMTase reaction maintains its initial 
velocity up to 90 minutes and continues up to 120 min- 
utes. This time course is inconsistent with dependence 
on enzyme-bound additional nonreplenishable substrates 
such as dCTP or ATP or a nonreplenishable redox factor 
10 such as NADH or NADPH. Exhausting the nonreplenish- 
able substrate or redox factor would have resulted in 
rapid deceleration of the initial velocity. 

A product of the demethylation reaction is deoxyCyto- 
15 sine in DNA 

What is the product of the demethylation reac- 
tion? The results presented above (Fig. la, 2a and b) 
based on a one dimension TLC separation show that DNA 
dMTase generates dC from mdC in DNA. To further sub- 

20 stantiate this conclusion, we subjected DNA dMTase 
treated DNA to remethylat ion with the CpG MeTase M.Sss 
I which can transfer a methyl group exclusively to dC . 
The results presented in Fig. 3a show that the demeth- 
ylated product of DNA dMTase is dC since it is com- 

25 pletely remethylated with M.Sss I. The identity of the 
demethylated product as dC was further established by a 
two-dimension TLC analysis demonstrating that the prod- 
uct of dMTase comigrates with a cold dCMP standard in 
both dimensions (Fig- 3b) . 

3 0 DNA dMTase does not release a nucleotide, a 

phosphorylated base or phosphate from methylated DNA 
when incubated with a [32pmdCpdG]n substrate which 
included a labeled 32P 5' to mdC or our standard meth- 
ylated substrate (Fig.l) where 32P is 3' to the m5dC 

35 (Fig. 3c). Nuclear extracts which obviously contain a 
number of glycosylases and nucleases release phospho- 
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rylated derivatives in the same assay (Fig. 3c) 
dMTase transforms the methyl cytosine in the 
[32pmdCpdG]n substrate to cytosine as demonstrated 
when the reacted DNA is digested to 5' mononucleotides 



_j \r . -1- v rru*D} ctnu. ana x y ^ eu. i^y j. .u^ . ^mce CH1S 

reaction does not involve release of a 32P derivative 
(Fig. 3c -V PDS) , it demonstrates that dMTase trans- 
forms methylated cytosines to cytosines on DNA without 
disrupting the integrity of the DNA substrate by glyco- 
0 sylase or nuclease activity . 

The second product of the dMTase reaction is methanol 

What is the identity of the leaving group? The 
results presented in Figlb suggest that the labeled 
5 methyl leaves the DNA as a volatile compound. The 
demethylase reaction involves release of the methyl 
group per se whereas the cytosine base ring remains in 
the aqueous phase. Fig. 4a demonstrates this point by 
using a methylated plasmid labeled with a 3 H~hydrogen 
) at the sixth position of cytosine and [14C] -methyl at 
the fifth position of cytosine as a substrate. 

The three most obvious candidates the methyl 
group is leaving as are formaldehyde, carbon dioxide, 
and methanol . Methadone trapping for labeled f ormalde- 
5 hyde detection and sodium hydroxide trapping for 
labeled carbon dioxide detection were both negative in 
identifying the form in which the methyl group is leav- 
ing in the dMTase reaction (data not shown) . The other 
possible chemical form that the methyl group may leave 
the DNA as, is methanol. Since methanol is a volatile 
compound, a simple method to measure generation of 
methanol is a scintillation-volatilization assay (see 
Fig. 4b for description) . Volatilization assays have 
been previously used to measure release of methanol in 
demethylat ion reactions. The demethylat ion reaction 
mix containing the labeled { [ 3 H] -CH 3 -dCpdG}n substrate 
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with either dMTase or no enzyme, as a control, is added 
to an uncapped 0.5 ml tube which is placed in a sealed 
scintillation vial containing scintillation fluid. 
Released methanol is volatile, diffuses out of the open 
reaction tube and is mixed with the excess of the scin- 
tillation fluid in the vial registering as counts in 
the scintillation counter. As a control indicating 
that methanol is volatilized under the conditions of 
our assay, we incubated approximately equal counts of 
radioact ively labeled methanol under the same condi- 
tions and measured the counts in a scintillation coun- 
ter at different time points. As observed in Fig. 4c 
the majority of methanol in the reaction tube volatil- 
izes from the reaction tube into the scintillation 
fluid following an overnight incubation at 37°C. The 
experiment shown in Fig. 4b demonstrates that volatil- 
ized label is released from methylated DNA only in the 
presence of dMTase. 

The identity of the volatile group has been 
determined to be methanol by a gas chromatography (GC) 
analysis. The demethylat ion and control reactions 
(indicated in Fig. 4e) were performed in an uncapped 
tube placed in a sealed scintillation vial containing a 
larger volume (300/zl) of water. The volatile residue 
diffuses into the surrounding water and mixes with it. 
A 2 jil sample of the surrounding water was injected 
into a GC column as described in the methods . As 
shown in Fig. 4e, the volatile compound released by 
dMTase in a dose response manner coelutes with metha- 
nol. Release of methanol is observed only in the pres- 
ence of both dMTase and methylated DNA. No methanol is 
released when dMTase is reacted with nonmethylated DNA, 
demonstrating that methanol is a product of demethyla- 
tion of DNA. 
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The leaving group was also identified as metha- 
nol using gas chromatography coupled with Mass spec- 
trometry (GC-MS) . As illustrated in Fig. 4f., incuba- 
tion of methylated DNA with dMTase (dMTase-f ME-DNA) 
5 results in release of a peak with the retention time 
and mass spectrum (peaks are identified at 32 and 29 
atomic mass which are the atomic masses of methanol and 
ionized methanol respectively) which is consistent with 
its identification as methanol. Incubation of dMTase 

10 with nonmethylated DNA does not release methanol indi- 
cating that methanol is a product of the demethylat ion 
reaction. No methanol is released when the samples are 
incubated with dMTase treated with protease K indicat- 
ing that the release of methanol from methylated DNA is 

15 catalyzed by an enzymatic activity. 

Demethylat ion involves transfer of a hydrogen from 
water to regenerate cytosine 

If demethylat ion involves removal of the methyl 

20 moiety from mdC, a hydrogen has to be transferred to 
the carbon at the 5 7 position to regenerate cytosine. 
Since no redox factors are involved, what is the source 
of the hydrogen? To test the hypothesis that the 
source of the hydrogen is water, we incubated either 

25 non labeled [mdCpdG] n or [dCpdG]n double stranded DNA 
with DNA dMTase for different time periods in the 
presence of tritiated water, following which the DNAs 
were digested to 3' dNMPs , separated on TLC with non- 
radioactive standards for each of the 5 possible dNMPs 

30 and exposed to a tritium sensitive phosphorimaging 
plate. As seen in Fig.4d, dMTase catalyzes the trans- 
fer of a tritiated hydrogen from water to dCMP in meth- 
ylated DNA in a time dependent manner only when meth- 
ylated DNA is used as a substrate. Based on the 

35 experiments described in Fig. 3 and 4 we propose that 
dMTase catalyzes the exchange of the methyl group at 
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the 5' position of cytosine in DNA with hydrogen from 
water and the methyl group reacts with the remaining 
hydroxyl group to form methanol (Fig. 5) . 

Substrate and sequence specificity of DNA dMTase 

Methylation of CpG dinucleotides is the most 
characterized modification occurring in genomic 
DNA8,48. The results presented in Fig. 6 demonstrate 
that DNA dMTase is a general DNA dMTase activity that 
demethylates fully or hemimethylated dCpdG in DNA 
flanked by a variety of sequences which are distributed 
at different frequencies, but does not demethylate 
methylated adenines or methylated cytosines that do not 
reside in the dinucleotide CG . First, as shown in 
Fig. 6a, a plasmid DNA methylated in vitro at all dCpdG 
sites with M.Sss I and all d*CdCdGdG sites with M. Msp 
I (which methylates the external C in the sequence 
*CCGG, thus enabling the determination of demethylat ion 
at the CC dinucleotide) and in vivo with the E . coll 
DCM MeTase at dCmdCdA/dTdGdG sites and with the DAM 
MeTase at dGmdAdTdC sites (adenine methylated) was 
treated with dMTase and the state of methylation of the 
plasmid was determined using the indicated methylation 
sensitive restriction enzymes. dMTase demethylates C*G 
methylated sites as indicated by the sensitivity of the 
dMTase treated plasmid to Hpa II and Hha. I but does not 
demethylate C*C,C*A or C*T methylated sites as indi- 
cated by the resistance to Msp I and Eco RII restric- 
tion enzymes, or adenine methylation as indicated by 
its sensitivity to Dpn I. Second, bisulfite mapping 
analysis of methylation of 5 methylated C*G sites 
residing in a M.Sss I in vitro methylated pMetCAT plas- 
mid following dMTase treatment shows that all C*G sites 
are demethylated irrespective of their flanking 
sequences thus excluding the possibility that demeth- 
ylation is limited to CCGG or CGCG sequences (Fig. 6b) . 
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Third, dMTase does not demethylate two fully methylated 
cytosine bearing oligomers [dmC3 2pdAl n , [mdC32pdT]n 
demonstrating that mdCpdA and mdCpdT are not demethyl- 
ated by DNA dMTase (Fig. 6d) . Fourth, dMTase demethyl- 
5 ates a hemimethylated synthetic substrate 

[dCpdG] n* [mdC3 2pdG] n (Fig. 6d) . Demethylat ion of SK is 
complete under these conditions (Fig. 6a) whereas 
demethylat ion of a methylated [mdCpdG] n substrate is 
not complete under the same conditions (Fig. 6d) . This 
10 can reflect differences in the sequence composition of 
the substrate and the frequency of methylated cyto- 
sines . The [mdCpdG] n contains on average 16 fold more 
methylated cytosines per molecule than plasmid DNA. 
Alternatively, these differences might reflect discrep- 

15 ancies in the assays used, restriction enzyme digestion 
versus a nearest neighbor analysis. To address this 
discrepancy we have labeled a fully methylated SK plas- 
mid with [a 32 P]dCTP, 5 -methyl -dCTP and the other dNTPs , 
subjected it to dMTase treatment and digested it to 

20 mononucleotides at different time points following the 
initiation of the reaction and subjected the samples to 
a TLC analysis. As shown in Fig. 6c, the SK plasmid is 
fully demethylated at 3 hours which is consistent with 
the results obtained with methylation sensitive 

25 restriction enzymes (Fig. 6a) . 

The Km of DNA dMTase for hemimethylated and 
fully methylated DNA was determined by measuring the 
initial velocity of the reaction at different concen- 
trations of substrate (Table 2). The calculated Km for 

30 hemimethylated DNA is 6 nM which is two fold higher 
than the Km for DNA methylated on both strands, 2.5-3 
nM (Table 2) . It is unclear yet whether this small 
difference in affinity to the substrate has any sig- 
nificance in a cellular context. Thus similar to the 

35 DNA MeTase DNA dMTase shows dinucleotide sequence 
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selectivity but in difference from DNA MeTase which 
shows preference to hemimethylated substrates dMTase 
prefers fully methylated DNA which is consistent with a 
role for DNA dMTase in altering established methylation 
5 patterns , 

Table 1 
Purification of DNA dMTase 



Purification step Total 

protein 
(Ml) 


Total dpm 


pMoIe/ug 


pMoIe/ng/h 


Fold 
Purification 


Nuclear extract 6000 


1107.2 


5.5 x 10" 5 


1.833 x 10' 5 




DEAE-Sephadex 3.75 


5844 


0.4674 


0.156 


8445.5 


SP-Sepharose 0.77 


5106 


1.989 


0.663 


35939.84 


Q-Sepharose 0.46 


5335 


3.4 


1.13 


62860.65 


DEAE-Sephacei 0.018 


1834 


30.57 


10.19 


552243.2 




Table 2 








Kinetic parameters for 


DNA dMTase 




Method 


(DNA) 




V max (pMole/h) 


Methylated oligo CpG 


2.5 nM 




340 




Hemi-rnethylated CpG 


6.0 nM 




402 




Methylated SK-DNA 


3.3 nM 




40.42 





Cloning and construction of demethylase expression 
vectors 

5 PCR amplification of the MBD domain of the putative 
demethylase candidate cDNA 

One fig of total RNA prepared from the human 
small lung carcinoma cell line A549 was reverse tran- 
scribed using Superscript reverse transcriptase and 
0 random primers (Boehringer) in a 25 fil reaction volume 
according to conditions recommended by the manufacturer 
(GIBCO-BRL) . Five fj.1 of reverse transcribed cDNA were 
subjected to an amplification reaction with Taq poly- 
merase (Promega, 1 unit) using the following set of 
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primers: sense 5 ' CTGGCAAGAGCGATGTC 3' SEQ ID NO ; 9 , 
antisense 5 1 AGTCTGGTTTACCCTTATTTTG 3' SEQ ID NO: 10. 

Amplification conditions were: step 1. 95°C 1 
min. ; step 2: 94°C 0.5 min; step 3: 45°C 0.5 min.; step 
5 4: 72°C 1.5 min; steps 2-4 were repeated 3 0 times. 
MgCl 2 was adjusted to 1 mM according to conditions rec- 
ommended by the manufacturer. The PCR products were 
cloned in pCR2 . 1 vector (InVitrogen) and the sequence 
of the cDNAs was verified by dideoxy- chain termination 

10 method using a T7 DNA sequencing kit (Pharmacia) . The 
amplified fragment was excised from the plasmid with 
EcoRI , labeled with a Boehringer random prime labeling 
kit according to manufacturer's protocol and alpha 32 P- 
dCTP. The labeled probe was used to screen a HeLa cell 

15 cDNA library in ^TriplEx phage (Clontech) according to 
standard procedures. Positive clones were identified 
and further purified by serial dilutions for 4 rounds. 
The insert in the pTriplEx plasmid was excised from the 
phage according to manufacturer's protocols and the 

20 identity of the insert was verified by sequencing. The 
insert was excised by NotI restriction and subcloned 
into either the inducible expression vector: Retro tet 
on (Clontech) in the sense and antisense orientation or 
the pcDNA3.1/His Xpress vector in all three frames and 

25 in the antisense orientation. 

Transfection and expression of demethylase in verte- 
brate cells 

Ten /ig of either Retro tet on demethylase or 
30 pcDNA 3.1/His Xpress demethylase are mixed with 8 fil of 
transfection lypophilic reagent Pfx-2 (Invitrogen) and 
placed upon 100,000 mouse (3T3 Balb/c, human (A549) or 
monkey cells (CV-1) according to manufacturer's proto- 
col in OPTIMEM medium for 4 hours. Cells are harvested 
35 after 48 hours and demethylat ion and demethylase activ- 
ity is determined by measuring total genomic DNA meth- 
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ylation using standard techniques or a cotransf ected in 
vitro methylated plasmid using a Hpall /Mspl restric- 
tion enzyme analysis. Cellular transformation is meas- 
ured by a soft agar assay. 

5 

Demethylation of pBluescript SK(+) Plasmid 

About 4 /ig plasmid pBluescript SK (Stratagene) 
was subjected to methylation using SssI methylase. The 
methylated plasmid (4 ng) was incubated for different 

10 time points as indicated with 30 (xl of DNA dMTase 
Fraction 4 of DEAE-Sephacel™ column under standard con- 
ditions, extracted with phenol: chloroform and precipi- 
tated with ethanol . About 1 ng of the plasmid were 
subjected to digestion with 10 units each of either of 

15 the restriction endonuclease EcoRII (GIBCO-BRL) , Dpnl , 
or Hpall (New England Biolabs) before and after meth- 
ylation as well as after DNA dMTase treatment in a 
reaction volume of 10 /xl for 2 hour at 37°C. Following 
restriction digestion the plasmids were extracted with 

20 phenol : chloroform, ethanol precipitated and resuspended 
in 10 fxl . The plasmids were electrophoresed on a 0.8% 
(w/w) Agarose gel, transferred onto a Hybond™ Nylon 
membrane and hybridized with pBluescript SK(+) plasmid 
which was 32 P labeled by random-priming (Boehringer 

2 5 Mannheim) . 

dMTase activity coelutes with a -45 KDa polypeptide 
when sized under denaturing conditions but migrates as 
a higher molecular weight complex under non denaturing 

30 conditions, dMTase was purified up to 500,000 fold by 
four chromatographic steps (Table 1) . We first deter- 
mined the identity of the polypeptide associated with 
dMTase activity by SDS-PAGE analysis of the active 
fractions. As observed in Fig. 7a, a cluster of 4 

35 polypeptide bands from -44 KDa to 35 KDa coelute with 
dMTase activity in the last two chromatographic steps 
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(the lower fragment might be a degradation product as 
evidenced by its abundance in the later chromatographic 
steps) , However when the active DEAE-Sephacel fraction 
is size fractionated on a 4% non denaturing acrylamide 
5 column, the dMTase art i vitv elutes at the hi ah moiecu- 
lar weight of -170 KDa (Fig. 7c, fraction 63). SDS- 
PAGE analysis of this fraction (63) reveals only two 
bands (Fig. 7b) observed in the active chromatographic 
fractions (Fig. 7a) To further determine whether 

10 dMTase is found in a multimeric complex, fraction 63 
was size fractionated on a glycerol gradient (Fig. 7d) 
and DNA dMTase activity eluted at the -170 kDa range. 
As only two main small polypeptides were identified in 
fraction 63 (approximately 35-43 KDa) , dMTase is proba- 

15 bly found in either a homomeric complex if only one of 
the two peptides is dMTase or a heteromeric complex if 
both polypeptides are associated with dMTase activity. 

a. Identification of a lead DNA cLMTase candidate by 

2 0 homology search of dbEST 

As the purification of dMTase suggests that the 
dMTase is of very low abundance, only -19 ng of dMTase 
could be isolated from 6 mg of nuclear extract 
(Table 1) , we opted for cloning the dMTase based on its 
25 following functional properties. First, since dMTase 
specifically demethylates methylated CG dinucleot ides , 
we assumed that it should bear the ability to recognize 
methylated CG dinucleot ides . Second, the demethylase 
transforms methylated cytosine in DNA to cytosine. 

3 0 Third, the demethylase releases the methyl group as a 

volatile compound . 

Previous reports have shown that proteins inter- 
acting with methylated DNA share a common domain 
(MDBD) . A TBLASTN search of the dbEST database identi- 
35 fied a novel expression tag cDNA (from a T-cell lym- 
phoma Homo Sapiens cDNA 5' end) (gb/AA36 1957/AA3 61957 
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EST71295) and the mouse homologue ( (gb/W97 165/W97165 
mf90g05.rl) from Scares mouse embryo NbME13.5) with 
unknown function that bears homology to the MDBD 
(Fig. 8a) . A search of the GenBank database verified 
5 that it is a novel cDNA that has not been included in 
GenBank. Alignment of the novel EST and MeCP2 and 
MeCPl associated protein has revealed no homology 
beyond the previously characterized MDBD which is con- 
sistent with a different function for this methylated 

10 DNA binding protein. A 201bp fragment bearing the 

sequence identified in the search was reverse tran- 
scribed and amplified from human lung cancer cell line 
A54 9 RNA and was used to screen a cDNA library from 
Hela cells. The largest insert cloned was of 1.36 kb 

15 size and its sequence identity with the EST sequence 
was determined. The cDNA is novel and has no homologue 
in GenBank and no function has ever been assigned to 
it. A virtual translation of the protein identified an 
open reading frame (ORE) of 262 amino acids (Fig. 8b). 

20 The ORF may extend further 5' as no in frame stop codon 
was found upstream of this ATG. However, RACE analy- 
ses and further searches of the dbEST have failed to 
identify 5' sequences upstream to the one identified in 
our screening. 

25 A BLAST search of the candidate protein using 

the Predict protein server against a database of pro- 
tein domain families has identified only the MDBD 
domain and found no homologue to the sequence in the 
data base search. No other functional motifs were 

30 identified by the Prosite analysis. This is consistent 
with a novel biochemical function for this protein. A 
coiled coil prediction of the sequence identified a 
coiled coil domain which is known to play a role in 
protein protein interactions. 
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The identified cDNA encodes an mRNA that is 
widely expressed in human cells as revealed by a North- 
ern blot analysis of human poly A + mRNA (Fig. 8c) as 
one major transcript of ~ 1.6 kb which is close to the 
5 size of the cloned cDNA # verifying that the cloned cDNA 
does not represent a highly repetitive RNA but rather a 
mRNA encoded by a single or low copy number gene. 

In vitro translated candidate cDNA bears dMTase activ- 
10 ity 

A conclusive proof for the existence of a single 
protein that bona fide demethylates DNA is to demon- 
strate that an in vitro translated candidate cDNA can 
volatilize methyl groups from methylated DNA and trans - 

15 form a methyl cytosine to cytosine in an isolated sys- 
tem. The candidate dMTase cDNA was subcloned it into a 
pcDNA3.1/His Xpress ( INVI TROGEN ) expression vector in 
the putative translation frame (pcDNA3.1His A) and in a 
single base frame shift (pcDNA3 . lHis B) , and was in 

20 vitro transcribed and translated in the presence of 
33 S -methionine and the resulting translation products 
were resolved by SDS-PAGE. Autoradiography revealed a 
-40KDa protein (Fig. 10a) . The apparent size of the in 
vitro translated protein is shorter by -3-5 KDa from 

25 the apparent size of the purified protein. The cloned 
cDNA might be missing some upstream amino acids as dis- 
cussed above or might be differently modified in human 
cells . 

Two tests established whether the in vitro 
3 0 translated candidate cDNA is a jbona fide dMTase . We 
first tested whether in vitro translated protein 
(purified on a Ni2+ charged agarose resin) can volatil- 
ize and release methyl residues in [ 3 H] -CH 3 -DNA using a 
radioactive trapping volatilization assay. To verify 
35 that the volatilized counts are true 3 H counts, a spec- 
trum analysis was performed. As demonstrated in Fig. 
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10b no volatilization of tritiated methyl residues is 
observed in the misframe dMTase (misframe) whereas in 
vitro translated putative dMTase cDNA catalyzes the 
volatilization of 3 H-CH 3 residues which are trapped in 
the scintillation cocktail. 

Second, in vitro translated dMTase cDNA trans- 



forms CH 3 -cytosine residing in [ 32 P] -a-dGTP labeled 
plasmid DNA or in [methyl ~dC3 2pdG] n double stranded 
oligomer DNA to cytosine, whereas a frame shift in 
vitro translated dMTase does not demethylate DNA (Fig. 
lOd) . This demonstrates that the dMTase activity is 
dependent on the dMTase translation product and not a 
contaminating activity found in the in vitro transla- 
tion kit that copurifies with the putative dMTase. The 
reaction carried out by the in vitro translated dMTase 
displays: dependence on the dose of in vitro translated 
product (Fig. 10c), time dependence (Fig. lOd) and 
dependence on translated protein (Fig. 10b & d mis- 
frame, Fig. 10c protease K treatment) . Taken together, 
these results strongly suggest that the cDNA cloned 
here codes for a bona fide enzymatic DNA demethylase 
act ivity . 

Transiently transfected dMTase cDNA demethylates DNA 

dMTase cDNA and the pcDNA3 . lHisC vector control 
were transiently transfected into human embryonal kid- 
ney cells to test whether the cDNA can direct expres- 
sion of dMTase activity in human cells. The His-tagged 
proteins were bound to Ni2+ agarose resin and eluted 
from the resin with increasing concentrations of imida- 
zole. The expression of the transfected dMTase was 
verified by a Western blot analysis (Fig. lib). The 
imidazole fractions were assayed for their ability to 
volatilize and release methyl residues in [ 3 H] - CH 3 - DNA 
using a radioactive trapping volatilization assay 1. 
As observed in Fig. 11a, imidazole fractions from 
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dMTase transfected cells volat il i ze [ H] -CH 3 whereas no 
tritiated counts are detected in DNA treated with imi- 
dazole fractions from cells transfected with a misframe 
mutation of dMTase or non transfected cells. The tran- 
5 siently expressed dMTase transforms methylated cytosine 
in DNA to cytosine residing in two different substrates 
(Figs. 11c Sc lid), in a protein dependent manner (Figs. 
11c & lie) , and the reaction displays substrate depend- 
ence and saturability (Fig. llf) . Transiently 

10 expressed dMTase was loaded on a non denaturing glyc- 
erol gradient to determine its native MW. Similar to 
dMTase purified from human cells, cloned and purified 
dMTase activity fractionated at the 160-190 KDa range 
(data not shown) . This is consistent with self asso- 

15 ciation of cloned dMTase possibly mediated by the 
coiled -coil domain . 

Cloned DNA dMTase catalyzes a hydrolysis of 5-methyl- 
cytosine to release methanol 

2 0 We determined the mechanism by which methyl 

residues are released by the cloned dMTase ( from Fig . 
11) and compared it to the purified bona fide dMTase 
activity. Increasing amounts of non labeled [methyl - 
dCpdG] DNA were incubated with either the bona fide 

25 dMTase activity purified from A549 cells or the cloned 
dMTase in the presence of [ 3 H] water for 3 hours fol- 
lowed by digestion to mononucleotides, a thin layer 
chromatography and autoradiography. As Fig. 12a shows, 
both reactions replace the methyl group in 5-methylcy- 

30 tosine with a proton donated from water as indicated by 
the presence of [ 3 H] label in cytosine. 

The identity of the leaving methyl group in the 
demethylation reaction catalyzed by the purified bona 
fide dMTase activity was shown to be methanol. In 

35 order to identify the form that the methyl residue 
leaves as in the demethylation reaction catalyzed by 
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the cloned dMTase an identical gas chromatography/mass 
spectrometry analysis of the reaction products was per- 
formed as inl . Only the properly translated form of 
dMTase (both in vitro translated and transiently trans- 
5 fected and nurified) is able to oroduce -innQ r-.H^ar-it-^T-- 
istic of methanol in a mass spectrometric analysis 
(mass of 32 and 29, Fig. 12b) . These results suggest 
that the demethylat ion reaction catalyzed by the cloned 
dMTase is hydrolysis of the 5 -methyl - cytosine to cyto- 
10 sine and methanol as described for the purified 
dMTasel . 

DNA dMTase activity is undetectable in nontransf ormed 
cells 

The assays for dMTase activity described here 
15 and the cloning of DNA dMTase cDNA enables a study of 
its expression at different cellular states. Global 
hypomethylation ■ of DNA is a common observation in can- 
cer cells. This has been a perplexing observation, 
since DNA MeTase activity is elevated in cancer cells. 
20 Hyperact ivation of DNA MeTase has been proposed to play 
a role in cancer development. This paradox raises 
questions on the proposed role of the elevated levels 
of DNA MeTase in cancer cells. One simple explanation 
that has been previously suggested to resolve this 
25 paradox is that cancer cells express induced levels of 
DNA dMTase. We compared the DNA dMTase activity in 
equal concentrations of DEAE-Sephadex fractionated 
nuclear extracts (fractions 9-10) prepared from a num- 
ber of carcinoma cell lines H446, Colo 205, Hela, and 
30 A549 with a similar preparation from human skin fibro- 
blast cells at initial rate conditions using 
[mdC32pdG]n double stranded oligomer as a substrate. 
As observed in Fig. 13a, whereas DNA dMTase activity is 
readily observed in all carcinoma cell lines, it is 
35 undetectable in nontransf ormed human cells. The 
absence of dMTase activity in human primary cells 
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reflects the situation in vivo since dMTase activity is 
undetectable in preparations from different murine tis- 
sues whereas dMTase activity is present in a murine 
carcinoma cell line P19 that was transfected with the 
5 H-Ras protooncogene,- or human tumors carried as xeno- 
grafts in the same strain of mouse (Fig. la: COLO 205, 
A549. Hela) . These conclusions were verified using the 
radioactive- trapping volatilization assay shown in Fig. 
13c . 

10 Since dMTase mRNA has been detected using a sen- 

sitive poly A+ Northern blot in all normal human tis- 
sues, we tested the hypothesis that the absence of 
detected dMTase activity in normal tissues reflects a 
quantitative difference in DNA dMTase mRNA between nor- 

15 mal tissues and cancer lines. A Northern blot analysis 
and quantification of dMTase mRNA by a slot blot analy- 
sis shown in Fig. 13d using total RNA supports this 
hypothesis. Whereas minute levels of dMTase mRNA are 
detected in normal tissues, high levels of dMTase are 

20 expressed in a murine carcinoma cell line Yl that bears 
a 30 fold amplification of Ha-ras. 

A second DNA ciemethylase dMTase2 identified in human 
and mouse 

cDNA sequences, predicted amino acid sequences, and 
25 GenBank accession numbers of both dMTasel and dMTase2 
from human and mouse are shown. We claim that the high 
level of identity of the two proteins (Figs 9c and e) 
suggests that the two proteins can perform the same 
function, DNA demethylat ion . The N-terminals of 

3 0 dMTasel and dMTase2 contain a Methylated DNA Binding 
Domain (MBD) and near their C-terminals is a coiled- 
coil domain, however the middle portions of the protein 
sequences have no homology to any know structural or 
catalytic motif. Importantly, their middle regions are 
35 still extensively homologous suggesting that the cata- 
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lytic site of the demethylase activity lies in this 
area on both proteins. 

Induced expression of DNA demethylase in the Antisense 
orientation inhibits tumori genes is ex vivo 

5 To test the hypothesis that inhibition of DNA 

dMTase can inhibit tumorigenesis tetracycline inducible 

vectors carrying the human dMTasel cDNA in either the 

sense or antisense orientation were constructed and 

transiently transfected into HEK 293 cells, treated for 

10 4 8 hours either in the presence or absence of doxycy- 
cline (a tetracycline analogue) , selected for the last 
24 hours with puromycin, and then plated on soft agar 
and allowed to grow for seven days. After seven days 
colonies were scored and the data presented clearly 

15 show that doxycycline induced expression of the dMTasel 
cDNA in the antisense orientation reduced colony forma- 
tion (Fig . 15 ) . 

Imidazole is a small molecule inhibitor of DNA 
demethylase activity 

20 A template small molecule, imidazole, was tested 

for the ability to inhibit DNA dMTase activity. In a 
volatilization of radioactive methyl residues assay, 
concentrations from 1/iM to lOmM of imidazole were incu- 
bated in a typical volatilization of radioactive methyl 

25 residues as described above. The graph clearly demon- 
strates a dose dependent inhibition of DNA dMTase 
activity by imidazole, and validates a rationale for 
testing imidazole based molecules as inhibitors of DNA 
dMTase activity (Fig. 16) . 

3 0 Identification of DNA demethylase cDNAs and protein 
sequences 

Fig. 9a illustrates cDNA sequence of human dMTasel (SEQ 
ID NO:l) and its predicted amino acid sequence (SEQ ID 
NO:2), including its Genbank location. Fig. 9b illus- 
3 5 trates cDNA sequence of human dMTase2 (SEQ ID NO : 3 ) and 
its predicted amino acid sequence (SEQ ID NO:4), includ- 
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ing its GenBank location. Fig. 9c illustrates protein 
sequence alignment of human dMTasel and human dMTase2 . 
Fig. 9d illustrates cDNA sequence of mouse dMTasel (SEQ 
ID NO: 5) and its predicted amino acid sequence (SEQ ID 
5 NO • 6 ) ,- including its GenBank location. Fig, Qp illus- 
trates cDNA sequence of mouse dMTase2 (SEQ ID NO: 7) and 
its predicted amino acid sequence (SEQ ID NO : 8 ) , 
including its GenBank location. Fig. 9f illustrates 
protein sequence alignment of mouse dMTasel and mouse 
10 dMTase2. 

While the invention has been described in con- 
nection with specific embodiments thereof, it will be 
understood that it is capable of further modifications 
and this application is intended to cover any varia- 

15 tions, uses, or adaptations of the invention following, 
in general, the principles of the invention and 
including such departures from the present disclosure 
as come within known or customary practice within the 
art to which the invention pertains and as may be 

20 applied to the essential features hereinbefore set 
forth, and as follows in the scope of the appended 
claims . 
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WHAT IS CLAIMED IS : 

1. A DNA demethylase enzyme and/or homologue 
thereof having about 4 0 KDa, and wherein said DNA 
dem.eth_yl3.se enzyme is overexpressed in cancer cells. 

2 . A cDNA encoding a human demethylase which com- 
prises a sequence set forth in SEQ ID NOS : 1 and 3. 

3. A cDNA homologous to the cDNA of claim 2, 
wherein said cDNA encoding mouse demethylase set forth 
in SEQ ID NOS : 5 and 7. 

4. The use of the expression of demethylase cDNA of 
claims 2 or 3 to alter DNA methylation patterns of DNA 
in vitro in cells or in vivo in humans, animals and in 
plant s . 

5. The use of claim 4, wherein said demethylase 
cDNA expression is under the direction of mammalian 
promoters . 

6. The use of claim 5, wherein said promoter is 
CMV. 

7. The use of claim 4, wherein said demethylase 
cDNA expression is under plant specific promoters to 
alter methylation in plants and to allow for altering 
states of development of plants and expression of for- 
eign genes in plants. 

8. The use of claim 4, wherein said demethylase 
cDNA expression is in the antisense orientation to 
inhibit demethylase in cancer cells for therapeutic 
processes . 
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9. The use of claim 9, wherein expression of 
demethylase cDNA in mammalian cells is to alter their 
differentiation state and to generate stem cells for 

theraneut i r.& . npl 1 ,q for- ^n-im^l rlnn-inn t-,^ f- ^ ^.^^ 

expression of foreign genes. 

10. The use of the expression of demethylase cDNA of 
claims 2 or 3 in bacterial or insect cells for produc- 
tion of large amounts of demethylase. 

11. The use of the expression of demethylase cDNA of 
claims 2 or 3 for the production of protein in verte- 
brate, insect or bacterial cells. 

12. The use of claim 11 for producing antibodies 
against demethylase . 

13 . The use of the sequence of demethylase cDNA of 
claim 2 as a template to design antisense oligonucleo- 
tides and ribozymes . 

14 . The use of the predicted peptide sequence of 
demethylase cDNA of claim 2 to produce polyclonal or 
monoclonal antibodies against demethylase. 

15. The use of expression of cDNA of claim 2 or 3 in 

two hybrid systems in yeast to identify proteins inter- 
acting with demethylase for diagnostic and therapeutic 
purposes . 

16 . The use of expression of cDNA of claim 2 or 3 in 

bacterial , vertebrate or insect cells to produce large 
amounts of demethylase for high throughput screening of 
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demethylase inhibitors for therapeutics and biotechnol- 
ogy and for obtaining the x-ray crystal structure. 



17. A volatile assay for high throughput screening 

of demethylase inhibitors as therapeutics and antican- 
cer agents which comprises the steps of: 

a) using transcribed and translated demethylase 
cDNA of claim 2 or 3 in vitro to convert methyl - 
cytosine present in methylated DNA samples to 
cytosine present in DNA and volatilize methyl 
group ; 

b) determining the absence or minute amount of 
volatilize methyl group as an indication of an 
active demethylase inhibitor. 



18. A volatile assay for the diagnostics of cancer 

in a patient sample which comprises the steps of: 

a) determining demethylase activity in patient sam- 
ples by determining conversion of methyl -cyto- 
sine present in methylated DNA to cytosine pres- 
ent in DNA and volatilization of the methyl 
group released as methanol; 

b) determining the presence or minute amount of 
volatilized methyl group as an indication of 
cancer in said patient sample. 



19. Use of an antagonist or inhibitor of DNA demeth- 

ylase of claim 1 or 2 for the manufacture of a medica- 
ment for cancer treatment, for restoring an aberrant 
methylation pattern in a patient DNA, or for changing a 
methylation pattern in a patient DNA. 



20. Use according to claim 19, wherein said antago- 

nist is a double stranded oligonucleotide that inhibits 
demethylase at a Ki of 50nM. 
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21. Use according to claim 20, wherein said oligonu- 

cleotide is f ' C m GC m GC m GC m G) 

[G m CG m CG m CG m Gjn 



22. Use according to claim 19, wherein the inhibitor 

comprises an anti-DNA demethylase antibody or an 
antisense oligonucleotide of DNA demethylase or a small 
molecule . 



23. Use according to one of claims 19 or 22, wherein 

the change of the methylation pattern activates a 
silent gene . 



24. Use according to claim 23, wherein the activa- 

tion of a silent gene permits the correction of genetic 
de f ect . 



25. Use according to claim 24, wherein said genetic 
defect is p- thalassemia or sickle cell anemia. 

26. Use of the demethylase of claim 1, for removing 
methyl groups on DNA in vitro. 



27 . Use of the demethylase of claim 1 or its cDNA of 

claim 2, for changing the state of differentiation of a 
cell to allow gene therapy, stem cell selection or cell 
cloning . 



28. Use of the demethylase of claim 1 or its cDNA, 

of claim 2 for inhibiting methylation in cancer cells 
using vector mediated gene therapy. 



29. An assay for the 

patient, which comprises 
expression of DNA demethyl 



diagnostic of cancer in a 
determining the level of 
ase of claim 1 in a sample 
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from said patient, wherein overexpression of said DNA 
demethylase is indicative of cancer cells. 
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SEQUENCE LISTING 

<110> McGILL. UNIVERSITY 
. SZYF, Moshe 

BHATTACHARYA, San joy K. 
RAMCHANDANI , Shyam 



<12 0> DNA DEMETHYLASE, THERAPEUTIC AND 
DIAGNOSTIC USES THEREOF 

<130> 1770-183 "PCT" FC/ld 

<15 0> CA 2,220,805 
<151> 1997-11-12 

<150> CA 2,230, 991 
<151> 1998-05-11 

<160> 10 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 1804 
<212> DNA 
<213> Unknown 

<400> 1 

ccgctctgcg ggcggggcgg gtctccggga ttccaagggc tcggttacgg aagaagcgca 60 

gagccggctg gggagggggc tggatgcgcg cgcacccggg gggaggccgc tgctgcccgg 120 

agcaggagga gggggagagc gcggcgggcg gcagcggcgc tggcggcgac tccgccatag 180 

agcagggggg ccagggcagc gcgctcgctc cgtccccggt gagcggcgtg cgcagggaag 24 0 

gcgctcgggg cggcggccgt ggccgggggc ggtggaagca ggcggcccgg ggcggcggcg 300 

tctgtggccg tggccgtggc cgtggccggg gtcggggccg tggccggggc cggggccggg 360 

gccgcggccg tccccagagt ggcggcagcg gccttggcgg cgacggcggc ggcggcgcgg 420 

gcggctgcgg cgtcggcagc ggtggcggcg tcgccccccg gcgggatcct gtccctttcc 480 

cgtcggggag ctcggggccg gggcccaggg gaccccgggc cacggagagc gggaagagga 54 0 

tggactgccc ggccctcccc cccggatgga agaaggagga agtgatccga aaatcagggc 600 

tcagtgctgg caagagcgat gtctactact tcagtccaag tggtaagaag ttcagaagta 660 

aacctcagct ggcaagatac ctgggaaatg ctgttgacct tagcagtttt gacttcagga 720 

ccggcaagat gatgcctagt aaattacaga agaacaagca gagactccgg aatgaccccc 780 

tcaatcagaa caagggtaaa ccagacctga acacaacatt gccaattaga caaactgcat 840 

caattttcaa gcaaccagta accaaattca cgaaccaccc gagcaataag gtgaagtcag 900 

acccccagcg gatgaatgaa caaccacgtc agcttttctg ggagaagagg ctacaaggac 960 

ttagcgcatc agatgtaaca gaacaaatta taaaaaccat ggagctacct aaaggtcttc 1020 

aaggagtcgg tccaggtagc aatgacgaga cccttctgtc tgctgtggcc agtgctttac 1080 

acacaagctc tgcgcccatc acaggacaag tctctgctgc cgtggaaaag aaccctgctg 1140 

tttggcttaa cacatctcaa cccctctgca aagctttcat tgttacagat gaagacatta 1200 

ggaaacagga agagcgagtc caacaagtac gcaagaaact ggaggaggca ctgatggccg 1260 

acatcctgtc ccgggctgcg gacacggagg aagtagacat tgacatggac agtggagatg 132 0 

aggcgtaaga atatgatcag gtaactttcg actgaccttc cccaagagca aattgctaga 1380 

aacagaatta aaacatttcc actgggtttc gcctgtaaga aaaagtgtac ctgagcacat 144 0 

agctttttaa tagcactaac caatgccttt ttagatgtat ttttgatgta tatatctatt 1500 

attccaaatg atgtttattt tgaatcctag gacttaaaat gagtctttta taatagcaag 1560 

cagggccctt ccggtgcagt gcagctttga ggccaggtgc agtctactgg aaaggtagca 1620 

cttacgtgaa atatttgttt cccccacagt tttaatataa acagatcagg agtaccaaat 1680 
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aagtttccca attaaagatt attatacttc actgtatata aacagatttt tatactttat 1740 
tgaaagaaga tacctgtaca ttcttccatc atcactgtaa agacaaataa atgactatat 1800 
tcac 1804 

<210> 2 
<211> 411 
<212> PRT 
< 2 1 3 > Unknown 

<400> 2 

Met Arg Ala His Pro Gly Gly Gly Arg Cys Cys Pro Glu Gin Glu Glu 

15 10 15 

Gly Glu Ser Ala Ala Gly Gly Ser Gly Ala Gly Gly Asp Ser Ala lie 

20 25 30 

Glu Gin Gly Gly Gin Gly Ser Ala Leu Ala Pro Ser Pro Val Ser Gly 

35 40 45 

Val Arg Arg Glu Gly Ala Arg Gly Gly Gly Arg Gly Arg Gly Arg Trp 

50 55 60 

Lys Gin Ala Gly Arg Gly Gly Gly Val Cys Gly Arg Gly Arg Gly Arg 
65 70 75 80 

Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg 

85 90 95 

Pro Pro Ser Gly Gly Ser Gly Leu Gly Gly Asp Gly Gly Gly Cys Gly 

100 105 110 

Gly Gly Gly Ser Gly Gly Gly Gly Ala Pro Arg Arg Glu Pro Val Pro 

115 120 125 

Phe Pro Ser Gly Ser Ala Gly Pro Gly Pro Arg Gly Pro Arg Ala Thr 

130 135 140 

Glu Ser Gly Lys Arg Met Asp Cys Pro Ala Leu Pro Pro Gly Trp Lys 
145 150 155 160 

Lys Glu Glu Val lie Arg Lys Ser Gly Leu Ser Ala Gly Lys Ser Asp 

165 170 175 

Val Tyr Tyr Phe Ser Pro Ser Gly Lys Lys Phe Arg Ser Lys Pro Gin 

180 185 190 

Leu Ala Arg Tyr Leu Gly Asn Thr Val Asp Leu Ser Ser Phe Asp Phe 

195 200 205 

Arg Thr Gly Lys Met Met Pro Ser Lys Leu Gin Lys Asn Lys Gin Arg 

210 215 220 

Leu Arg Asn Asp Pro Leu Asn Gin Asn Lys Gly Lys Pro Asp Leu Asn 
225 230 235 240 

Thr Thr Leu Pro lie Arg Gin Thr Ala Ser lie Phe Lys Gin Pro Val 

245 250 255 

Thr Lys Val Thr Asn His Pro Ser Asn Lys Val Lys Ser Asp Pro Gin 

260 265 270 

Arg Met Asn Glu Gin Pro Arg Gin Leu Phe Trp Glu Lys Arg Leu Gin 

275 280 285 

Gly Leu Ser Ala Ser Asp Val Thr Glu Gin He He Lys Thr Met Glu 

290 295 300 

Leu Pro Lys Gly Leu Gin Gly Val Gly Pro Gly Ser Asn Asp Glu Thr 
305 310 315 320 

Leu Leu Ser Ala Val Ala Ser Ala Leu His Thr Ser Ser Ala Pro He 

325 330 335 

Thr Gly Gin Val Ser Ala Ala Val Glu Lys Asn Pro Ala Val Trp Leu 

340 345 350 

Asn Thr Ser Gin Pro Leu Cys Lys Ala Phe He Val Thr Asp Glu Asp 
355 360 365 
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lie Arg Lys Gin Glu Glu Arg Val 

370 375 

Glu Ala Leu Met Ala Asp lie Leu 
385 390 

Met Asp lie Glu Met Asp Ser Gly 

405 



Gin Gin Val Arg Lys Lys Leu Glu 

380 

Ser Arg Ala Ala Asp Thr Glu Glu 
395 400 
Asp Glu Ala 
410 



<211> 1589 

<212> DNA 

<2 13 > Unknown 



<400> 3 

cacgcgcggg cgggtgggcg gagcggcccc cctagcgggg gctgtgaagc gcggggaggg 6 0 

ggccgagcgg gtggcgaagc cggcgcgcgc ccggctgggg gcggagggcg gaggcccgtg 12 0 

ggacagaaca gctgcggcga gtggcggcgg cggagggagc cgaatcggcg acgagcccgg 180 

gggtcgcaac ttgcagaagc ggcggcggcg gcggcatcgg ccacggcggg cggaaaagcc 240 

9999 c 9 caa t ggagcggaag aggtgggagt gcccggcgct cccgcagggc tgggaaaggg 300 

aagaagtgcc caggaggtcg gggctgtcgg ccggccacag ggatgtcttt tactatagcc 360 

ccagcgggaa gaagttccgc agcaagccac aactggcacg ttacctgggc ggatccatgg 420 

acctcagcac cttcgacttc cgcaccggaa agatgttgat gaacaagatg aataagagtc 480 

gccagcgtgt gcgctatgat tcttccaacc aggtcaaggg caagcctgac ctgaacaccg 540 

cgctgcctgt acggcagact gcatccatct tcaagcaacc ggtgaccaag atcaccaacc 600 

accccagcaa caaggtcaag agcgacccgc agaaggcagt ggaccagccg aggcagcttt 660 

tctgggagaa gaagctaagt ggattgagtg cctttgacat tgcagaagaa ctggtcagga 720 

ccatggactt gcccaagggc ctgcagggag tgggccctgg ctgtacagat gagacgctgc 780 

tgtcagccat tgcgagtgct ctacacacca gcaccctgcc cattacaggc cagctctctg 840 

cagccgtgga gaagaaccct ggtgtgtggc tgaacactgc acagccactg tgcaaagcct 900 

tcatggtgac agatgacgac atcaggaagc aggaggagct ggtacagcag gtacggaagc 960 

gcctggagga ggcactgatg gccgacatgc tagctcatgt ggaggagctt gcccgagacg 1020 

gggaggcacc actggacaag gcctgtgcag aggaggaaga ggaggaggaa gaggaggagg 108 0 

aagagccgga gccagagcga gtgtagcaca ggtgccctgc ccaagtctgg gctgcagact 114 0 

gccttcagcc ttgcctggac caggtagggg ccagacctgt aggaggcagc cgtccacctc 1200 

ctttccaaag cctcctgctt ccaggtctca gtgcagggag cccctgtgga ccttgaactc 1260 

acttgtccct gcgctgcctg gcaggaagcc ccacactgaa agcagatgag cagtgaccca 1320 

actgagaggc cacctggaca cagtcacctc cctgcctcct tatcatagga caaggccttg 1380 

cttggcaccg aggagctggg agccgtgttg ggtgctggag gaagtttctg gaaacacacc 1440 

tggctatgcc caccttatgt ccctaaggct attacaggcc agggtttgga ctgctccggc 1500 

ccacagggct gcccagcctc cccacactga gggtcagcag cccaccagga agtcactttc 1560 

cttcaataaa ctgatggtag gaacttgtg 1589 



<210> 4 
<211> 291 
<212> PPT 
<213> Unknown 



<400> 4 

Met Glu Arg Lys Arg Trp Glu Cys Pro Ala Leu Pro Gin Gly Trp Glu 

15 10 15 

Arg Glu Glu Val Pro Arg Arg Ser Gly Leu Ser Ala Gly His Arg Asp 

20 25 30 

Val Phe Tyr Tyr Ser Pro Ser Gly Lys Lys Phe Arg Ser Lys Pro Gin 

35 40 45 

Leu Ala Arg Tyr Leu Gly Gly Ser Met Asp Leu Ser Thr Phe Asp Phe 

50 55 60 

Arg Thr Gly Lys Met Leu Met Ser Lys Met Asn Lys Ser Arg Gin Arg 
65 70 75 80 
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Val Arg Tyr Asp 

Thr Ala Leu Pro 

100 

Thr Lys lie Thr 
115 

Lys Ala Val Asp 
130 

Gly Leu Asn Ala 
145 

Leu Pro Lys Gly 

Leu Leu Ser Ala 

180 

Thr Gly Gin Leu 
195 

Asn Thr Thr Gin 
210 

lie Arg Lys Gin 
225 

Glu Ala Leu Met 

Asp Gly Glu Ala 

260 

Glu Asp Glu Glu 
275 

Glu His Val 
290 



Ser Ser Asn Gin 
85 

Val Arg Gin Thr 

Asn His Pro Ser 

120 

Gin Pro Arg Gin 

w w 

Phe Asp He Ala 
150 

Leu Gin Gly Val 
165 

He Ala Ser Ala 

Ser Ala Ala Val 

200 

Pro Leu Cys Lys 
215 

Glu Glu Leu Val 
230 

Ala Asp Met Leu 
245 

Pro Leu Asp Lys 

Glu Glu Glu Glu 

280 



Val Lys Gly Lys 
90 

Ala Ser He Phe 
105 

Asn Lys Val Lys 

Leu Phe Trp Glu 

Glu Glu Leu Val 
155 

Gly Pro Gly Cys 
170 

Leu His Thr Ser 
185 

Glu Lys Asn Pro 

Ala Phe Met Val 

220 

Gin Gin Val Arg 
235 

Ala His Val Glu 
250 

Ala Cys Ala Glu 
265 

Glu Pro Asp Pro 



Pro Asp Leu Asn 
95 

Lys Gin Pro Val 
110 

Ser Asp Pro Gin 
125 

Lys Lys Leu Ser 

Lys Thr Met Asp 

160 

Thr Asp Glu Thr 
175 

Thr Met Pro He 
190 

Gly Val Trp Leu 
205 

Thr Asp Glu Asp 

Lys Arg Leu Glu 

240 

Glu Leu Ala Arg 
255 

ASp Asp Asp Glu 
270 

Asp Pro Glu Met 
285 



<210> 5 
<211> 1966 
<212> DNA 
<213> Unknown 



<400> 5 

99999 c 9tgg ccccgagaag gcggagacaa gatggccgcc catagcgctt ggaggaccta 60 

a 9 a 99 c 99tg gccggggcca cgccccgggc aggagggccg ctctgtgcgc gcccgctcta 120 

tgatgcttgc gcgcgtcccc cgcgcgccgc gctgcgggcg gggcgggtct ccgggattcc 180 

aagggctcgg ttacggaaga agcgcagcgc cggctgggga gggggctgga tgcgcgcgca 24 0 

cccgggggga ggccgctgct gcccggagca ggaggagggg gagagtgcgg cgggcggcag 300 

cggcgctggc ggcgactccg ccatagagca ggggggccag ggcagcgcgc tcgccccgtc 3 60 

cccggtgagc ggcgtgcgca gggaaggcgc tcggggcggc ggccgtggcc gggggcggtg 420 

gaagcaggcg ggccggggcg gcggcgtctg tggccgtggc cggggccggg gccgtggccg 4 80 

gggacgggga cggggccggg gccggggccg cggccgtccc ccgagtggcg gcagcggcct 540 

tggcggcgac ggcggcggct gcggcggcgg cggcagcggt ggcggcggcg ccccccggcg 600 

ggagccggtc cctttcccgt cggggagcgc ggggccgggg cccaggggac cccgggccac 6 60 

ggagagcggg aagaggatgg attgcccggc cctccccccc ggatggaaga aggaggaagt 72 0 

gatccgaaaa tctgggctaa gtgctggcaa gagcgatgtc tactacttca gtccaagtgg 7 80 

taagaagttc agaagcaagc ctcagttggc aaggtacctg ggaaatactg ttgatctcag 840 

cagttttgac ttcagaactg gaaagatgat gcctagtaaa ttacagaaga acaaacagag 900 

actgcgaaac gatcctctca atcaaaataa gggtaaacca gacttgaata caacattgcc 960 

aattagacaa acagcatcaa ttttcaaaca accggtaacc aaagtcacaa atcatcctag 1020 

taataaagtg aaatcagacc cacaacgaat gaatgaacag ccacgtcagc ttttctggga 1080 

gaagaggcta caaggactta gtgcatcaga tgtaacagaa caaattataa aaaccatgga 1140 

actacccaaa ggtcttcaag gagttggtcc aggtagcaat gatgagaccc ttttatctgc 1200 

tgttgccagt gctttgcaca caagctctgc gccaatcaca gggcaagtct ccgctgctgt 1260 

ggaaaagaac cctgctgttt ggcttaacac atctcaaccc ctctgcaaag cttttattgt 1320 
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cacagatgaa gacatcagga aacaggaaga gcgagtacag caagtacgca agaaattgga 1380 

agaagcactg atggcagaca tcttgtcgcg agctgctgat acagaagaga tggatattga 1440 

aatggacagt ggagatgaag cctaagaata tgatcaggta actttcgacc gactttcccc 1500 

aagrgaaaat tcctagaaat tgaacaaaaa tgtttccact ggcttttgcc tgtaagaaaa 1560 

aaaatgtacc cgagcacata gagcttttta atagcactaa ccaatgcctt tttagatgta 1620 

tttttgatgt atatatctat tattcaaaaa atcatgttta ttttgagtcc taggacttaa 1680 

aattagtctt ttgtaatatc aagcaggacc ctaagatgaa gctgagcttt tgatgccagg 1740 

tgcaatctac tggaaatgta gcacttacgt aaaacatttg ttccccccac agttttaata 1800 

agaacagatc aggaattcta aataaatttc ccagttaaag attattgtga cttcactgta 1860 

tataaacata tttttatact ttattgaaag gggacacctg tacattcttc catcatcact 1920 

gtaaagacaa ataaatgatt atattcacaa aaaaaaaaaa aaaaaa 1966 

<210> 6 

<211> 414 

<212> PRT 

<213> Unknown 

<400> 6 

Met Arg Ala His Pro Gly Gly Gly Arg Cys Cys Pro Glu Gin Glu Glu 

15 10 15 

Gly Glu Ser Ala Ala Gly Gly Ser Gly Ala Gly Gly Asp Ser Ala lie 

20 25 30 

Glu Gin Gly Gly Gin Gly Ser Ala Leu Ala Pro Ser Pro Val Ser Gly 

35 40 45 

Val Arg Arg Glu Gly Ala Arg Gly Gly Gly Arg Gly Arg Gly Arg Trp 

50 SB 60 

Lys Gin Ala Ala Arg Gly Gly Gly Val Cys Gly Arg Gly Arg Gly Arg 
65 70 75 80 

Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg Gly Arg 

85 90 95 

Pro Gin Ser Gly Gly Ser Gly Leu Gly Gly Asp Gly Gly Gly Gly Ala 

100 105 110 

Gly Gly Cys Gly Val Gly Ser Gly Gly Gly Val Ala Pro Arg Arg Asp 

115 120 125 

Pro Val Pro Phe Pro Ser Gly Ser Ser Gly Pro Gly Pro Arg Gly Pro 

130 135 140 

Arg Ala Thr Glu Ser Gly Lys Arg Met Asp Cys Pro Ala Leu Pro Pro 
145 150 155 160 

Gly Trp Lys Lys Glu Glu Val lie Arg Lys Ser Gly Leu Ser Ala Gly 

165 170 175 

Lys Ser Asp Val Tyr Tyr Phe Ser Pro Ser Gly Lys Lys Phe Arg Ser 

180 185 190 

Lys Pro Gin Leu Ala Arg Tyr Leu Gly Asn Ala Val Asp Leu Ser Ser 

195 200 205 

Phe Asp Phe Arg Thr Gly Lys Met Met Pro Ser Lys Leu Gin Lys Asn 

210 215 220 

Lys Gin Arg Leu Arg Asn Asp Pro Leu Asn Gin Asn Lys Gly Lys Pro 
225 230 235 240 

Asp Leu Asn Thr Thr Leu Pro lie Arg Gin Thr Ala Ser lie Phe Lys 

245 250 255 

Gin Pro Val Thr Lys Phe Thr Asn His Pro Ser Asn Lys Val Lys Ser 

260 265 270 

Asp Pro Gin Arg Met Asn Glu Gin Pro Arg Gin Leu Phe Trp Glu Lys 

275 280 285 

Arg Leu Gin Gly Leu Ser Ala Ser Asp Val Thr Glu Gin lie lie Lys 
290 295 300 
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Thr Met Glu Leu Pro Lys Gly Leu Gin Gly Val Gly Pro Gly Ser Asn 

305 310 315 320 

Asp Glu Thr Leu Leu Ser Ala Val Ala Ser Ala Leu His Thr Ser Ser 

325 330 335 

Ala Pro lie Thr Gly Gin Val Ser Ala Ala Val Glu Lys Asn Pro Ala 

340 345 350 

Val Trp Leu Asn Thr Ser Gin Pro Leu Cys Lys Ala Phe lie Val Thr 

7 c; c; -i £rt 7 £ c; 

W W — ' W W <W W W -mj 

Asp Glu Asp lie Arg Lys Gin Glu Glu Arg Val Gin Gin Val Arg Lys 

370 375 380 

Lys Leu Glu Glu Ala Leu Met Ala Asp lie Leu Ser Arg Ala Ala Asp 
385 390 395 400 

Thr Glu Glu Val Asp lie Asp Met Asp Ser Gly Asp Glu Ala 

405 410 

<210> 7 
<211> 2392 
<212> DNA 
<2 13 > Unknown 

<400> 7 

agcgggccga ggagccgggc gcaatggagc ggaagaggtg ggagtgcccg gcgctcccgc 60 

agggctggga gagggaagaa gtgcccagaa ggtcggggct gtcggccggc cacagggatg 12 0 

tcttttacta tagcccgagc gggaagaagt tccgcagcaa gccgcagctg gcgcgctacc 180 

tgggcggctc catggacctg agcaccttcg acttccgcac gggcaagatg ctgatgagca 240 

agatgaacaa gagccgccag cgcgtgcgct acgactcctc caaccaggtc aagggcaagc 3 00 

ccgacctgaa cacggcgctg cccgtgcgcc agacggcgtc catcttcaag cagccggtga 3 60 

ccaagattac caaccacccc agcaacaagg tcaagagcga cccgcagaag gcggtggacc 420 

agccgcgcca gctcttctgg gagaagaagc tgagcggcct gaacgccttc gacattgctg 4 80 

aggagctggt caagaccatg gacctcccca agggcctgca gggggtggga cctggctgca 540 

cggatgagac gctgctgtcg gccatcgcca gcgccctgca cactagcacc atgcccatca 600 

cgggacagct ctcggccgcc gtggagaaga accccggcgt atggctcaac accacgcagc 660 

ccctgtgcaa agccttcatg gtgaccgacg aggacatcag gaagcaggaa gagctggtgc 720 

agcaggtgcg gaagcggctg gaggaggcgc tgatggccga catgctggcg cacgtggagg 7 80 

agctggcccg tgacggggag gcgccgctgg acaaggcctg cgctgaggac gacgacgagg 84 0 

aagacgagga ggaggaggag gaggagcccg acccggaccc ggagatggag cacgtctagg 900 

gcagaggccc tgccgagagc ccgtgctgcc tgctggagcc gcctgcagac gcggtcctcg 960 

gccccacgtg aaccaggctc ggcggcgaag cccagccttg gagacaccca ggaggaaggc 1020 

cgtgctcctg gctccctcct cggcccgtcc ccacttcccg gggcctcggg gcacacagct 1080 

ggggctgccc ccacccgaaa gaccctccac gctcgtcctc tacagagtcc ggcttcggga 1140 

agtgccgggt gctcctgggc cctgcctggc tccctacgac ctttgggctc gaggccagct 1200 

cctccccatg cccgctgtcc cagctccttg agactggaga gcagccagca ggtgcccggc 1260 

agctcggcgc cacggcttgc tgacagctgg gagggtttct cggtctggag gcgtagtttt 1320 

gaaactcaca tcacccactg tgcagcgtga ggacgggact ctggtctgct gtggggggca 1380 

tgcaggacgg cgccactctc tgccctgcca tgcggctggt ggtgccacag agcctcaccg 1440 

tgcctgagtg gcgtgcccag ggaggccgct ctccttcagt aaatgtaaca cagtcgaggc 1500 

acgtcatcgg gcagccttcc ctgtgtgcca acgccagcct tcgcttctga aaaccaaact 1560 

ccagccgctg ccagtcggga cttggtcgcc cggcgctgcc agaatgctcc actgccagcc 1620 

ggcccccctg cctcggtttc ccttctgttt agtggcgaca caggcaccca gctttggggt 1680 

ggtgctgacg ctcccagggg tgccaggagc cactgggaca gggtgaggct cccagacgct 1740 

cctcgaggtg cccagctctc cagggagctt ctggcccaag gcgttcttga gggatctgct 1800 

ccttaacccc ccagtgcctt ggcgagggca ggttccaagc cacagacgcc tgccccgagt 1860 

ggactttgcg gccagtccct gggtgccttc ctgggccctg cttgcccagt gagggttcct 1920 

aacgggtggg ttcawtggcc tggcccvagc gagcccccac ctgcattgac cttaggccca 1980 

tagagagggc ctgtcccggt gctgccccag ccaaggatct ggtcgctgcc ccagggggac 2040 

tgatgggcaa gagtcgcccc tgtggctgga ctgtgaccat ccctgatggg gcctgaccgc 2100 

gggagctgag gaagcgccgc tccaccgtct gccctccaag gacccgcatg gaggcagtgg 2160 
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gctggcagct tcctgctgct ccctgtrcaga gtcaaagcac aaatcctcag gacgggctca 2220 

a 999 cca 999 cagccgaggg aagctccagg tggggaccac gtcttcctga ggttggtgcc 2280 

cactggctgg gaccctttgc agtggggtgg cctcccctct gtctgcctgg tggagggagc 2340 

cgtgggcgtg gggacgtgac tgaataaagc caccatgggt ggatgtgctt gg 23 92 

<210> 8 
<211> 285 
<212> PRT 
<213> Unknown 



<400> 8 



Met 


Glu 


Arg 


Lys 


Arg 


Trp 


Glu 


Cys 


Pro 


Ala 


Leu 


Pro 


Gin 


Gly 


Trp 


Glu 


1 








5 










1 0 










15 




Arg 


Glu 


Glu 


Val 


Pro 


Arg 


Arg 


Ser 


Gly 


Leu 


Ser 


Ala 


Gly 


Hi s 


Arg 


Asp 








2 0 










2 5 










30 






Val 


Phe 


Tyr 


Tyr 


Ser 


Pro 


Ser 


Gly 


Lys 


Lys 


Phe 


Arg 


Ser 


T" 

Lys 


Pro 


Gin 






35 










4 0 










4 5 








Leu 


Ala 


Arg 


Tyr 


Leu 


Gly 


Gly 


Ser 


Met 


Asp 


Leu 


Ser 


Tnr 


Phe 


Asp 


Phe 




5 0 










55 










60 










Arg 


Txir 


Gly 


Lys 


Met 


Leu 


net. 


Asn 


Lys 


rlcl 


~T\ t~* -w^» 

Asn 


Lys 


ber 


Arg 


Gin 


Arg 


65 










7 0 










7 5 










8 0 


Val 


Arg 


Tyr 


Asp 


Ser 


Ser 


Asn 


Gin 


val 


Lys 


Gly 


Lys 


Pro 


Asp 


Leu 


Asn 










85 










90 










95 




Thr 


Ala 


Leu 


Pro 


Val 


Arg 


Gin 


Thr 


Ala 


Ser 


He 


Phe 


Lys 


Gin 


Pro 


Val 








100 










105 










110 






Thr 


Lys 


Tie 


Thr 


Asn 


His 


Pro 


Ser 


Asn 


Lys 


Val 


Lys 


Ser 


Asp 


Pro 


Gin 






115 










120 










125 








Lys 


Ala 


Val 


Asp 


Gin 


Pro 


Arg 


Gin 


Leu 


Phe 


Trp 


Glu 


Lys 


Lys 


Leu 


Ser 




130 










135 










140 










Gly 


Leu 


Ser 


Ala 


Phe 


Asp 


lie 


Ala 


Glu 


Glu 


Leu 


Val 


Arg 


Thr 


Met 


Asp 


145 










150 










155 










160 


Leu 


Pro 


Lys 


Gly 


Leu 


Gin 


Gly 


Val 


Gly 


Pro 


Gly 


Cys 


Thr 


Asp 


Glu 


Thr 










165 










170 










175 




Leu 


Leu 


Ser 


Ala 


lie 


Ala 


Ser 


Ala 


Leu 


His 


Thr 


Ser 


Thr 


Leu 


Pro 


He 








180 










185 










190 






Thr 


Gly 


Gin 


Leu 


Ser 


Ala 


Ala 


Val 


Glu 


Lys 


Asn 


Pro 


Gly 


Val 


Trp 


Leu 






195 










200 










205 








Asn 


Thr 


Ala 


Gin 


Pro 


Leu 


Cys 


Lys 


Ala 


Phe 


Met 


Val 


Thr 


Asp 


Asp 


Asp 




210 










215 










220 










lie 


Arg 


Lys 


Gin 


Glu 


Glu 


Leu 


Val 


Gin 


Gin 


Val 


Arg 


Lys 


Arg 


Leu 


Glu 


225 










230 










235 










240 


Glu 


Ala 


Leu 


Met 


Ala 


Asp 


Met 


Leu 


Ala 


His 


Val 


Glu 


Glu 


Leu 


Ala 


Arg 










245 










250 










255 




Asp 


Gly 


Glu 


Ala 


Pro 


Leu 


Asp 


Lys 


Ala 


Cys 


Ala 


Glu 


Glu 


Glu 


Glu 


Glu 








260 










265 










270 






Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Glu 


Pro 


Glu 


Pro 


Glu 


Arg 


Val 












275 










280 










285 









<210> 9 
<211> 17 
<212> DNA 
<213> Unknown 



<400> 9 
ctggcaagag cgatgtc 



17 
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<210> 10 
<211> 22 
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